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COMPOUNDS THAT INHIBIT INTERACTION BETWEEN SIGNAL-TRANSDUCING PROTEINS 
AND THE GLGF (PDZ/DHR) DOMAIN AND USES THEREOF 

The invention disclosed herein was made with Government 
support under Grant No. R01GM55147-01 from the National 
Institutes of Health of the United States Department of 
10 Health and Human Services. Accordingly, the U.S. 

Government has certain rights in this invention. 

BACKGROUND 

15 Throughout this application, various publications are 

referenced by author and date. Full citations for these 
publications may be found listed alphabetically at the 
end of the specification immediately preceding Sequence 
Listing and the claims. The disclosures of these 

20 publications in their entireties are hereby incorporated 

by reference into this application in order to more fully 
describe the state of the art as known to those skilled 
therein as of the date of the invention described and 
claimed herein. 

25 

Fas (APO-1/CD95) and its ligand have been identified as 
important signal -mediators of apoptosis (Itoh, et al . 
1991) The structural organization of Fas (APO-1/CD95) has 
suggested that it is a member of the tumor necrosis 

30 factor receptor superfamily, which also includes the p75 

nerve growth factor receptor (NGFR) (Johnson, et al. 
1986), the T-cell-activation marker CD27 (Camerini, et 
al. 1991), the Hodgkin-lymphoma-associated antigen CD30 
(Smith, et al. (1993), the human B cell antigen CD40 

35 (Stamenkovic, et al. 1989), and T cell antigen 0X40 

(Mallett, et al. 1990) . Genetic mutations of both Fas 
and its ligand have been associated with 
lymphoproliferative and autoimmune disorders in mice 
(Watanabe-Fukunaga, et al. 1992; Takahashi, et al . 1994). 

40 
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Furthermore, alterations of Fas expression level have 
been thought to lead to the induction of apoptosis in 
T-cells infected with human immunodeficiency virus (HIV) 
(Westendorp, et al . 1995). 

5 

Several Fas -interacting signal transducing molecules, 
such as Fas-associated phosphatase- 1 (FAP-1) (Figure 1) 
(Sato, et al. 1995) FADD/M0RT1/CAP-1/CAP-2 (Chinnaiyan, 
et al. 1995; Boldin, et al. 1995; Kischkel, et al. 1995) 

10 and RIP (Stanger, et al. 1995), have been identified 

using yeast two-hybrid and biochemical approaches. All 
but FAP-1 associate with the functional cell death domain 
of Fas and overexpression of FADD/M0RT1 or RIP induces 
apoptosis in cells transfected with these proteins. In 

15 contrast, FAP-1 is the only protein that associates with 

the negative regulatory domain (C-terminal 15 amino 
acids) (Ito, et al. 1993) of Fas and that inhibits 
Fas -induced apoptosis. 

20 FAP-1 (PTPN13) has several alternatively- spliced forms 

that are identical to PTP-BAS/hPTPlE/PTPLl , (Maekawa, et 
al. 1994; Banville, et al . 1994; Saras, et al. 1994) and 
contains a membrane -binding region similar to those found 
in the cytoskeleton-associated proteins, ezrin, (Gould et 

25 al. 1989) radixin (Funayama et al. 1991) moesin (Lankes, 

et al. 1991), neurofibromatosis type II gene product 
(NFII) (Rouleau, et al. 1993), and protein 4.1 (Conboy, 
et al. 1991), as well as in the PTPases PTPH1 (Yang, et 
al. 1991), PTP-MEG (Gu, et al . 1991), and PTPD1 (Vogel, 

30 et al. 1993). FAP-1 intriguingly contains six GLGF 

(PDZ/DKR) repeats that are thought to mediate intra -and 
inter-molecular interactions among protein domains. The 
third GLGF repeat of FAP-1 was first identified as a 
domain showing the specific interaction with the 

35 C-terminus of Fas receptor (Sato, et al. 1995). This 

suggests that the GLGF domain may play an important role 
in targeting proteins to the submembranous cytoskeleton 
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and/or in regulating biochemical activity. GLGF repeats 
have been previously found in guanylate kinases, as well 
as in the rat post-synaptic density protein (PSD-95) (Cho, 
et al. 1992), which is a homolog of the Drosophila tumor 
5 suppressor protein, lethal- (1) -disc- large- 1 [dlg-1] 

(Woods, et al 1991; Kitamura, et al. 1994). These 
repeats may mediate homo- and hetero-dimerization, which 
could potentially influence PTPase activity, binding to 
Fas, and/or interactions of FAP-1 with other signal 
10 transduction proteins. Recently, it has also been 

reported that the different PDZ domains of proteins 
interact with the C-terminus of ion channels and other 
proteins (Figure 1) (TABLE 1) (Kornau, et al . 1995; Kim, 
et al. 1995; Matsumine, et al . 1996). 

15 



TABLE 1. Proteins that interact with PDZ domains. 



Protein 


C- terminal 
sequence 


Associated 
protein 


Reference 


Fas (AP0-1/CD95) 


SLV 


FAP-1 


2 


NMDA receptor 
NR2 subunit 


SDV 


PSD95 


3 


Shaker -type K+ 
channel 


TDV 


PSD95 & DLG 


4 


APC 


TEV 


DLG 


5 
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SUMMARY OF THE INVENTION 

This invention provides a composition capable of 
inhibiting specific binding between a signal -transducing 
5 protein and a cytoplasmic protein containing the amino 

acid sequence (G/S/A/E) -L-G- (F/I/L) (Sequence I.D. No.: 
1) . Further, the cytoplasmic protein may contain the 
amino acid sequence (K/R/Q) -X n - (G/S/A/E) -L-G- (F/I/L) 
(Sequence I.D. No.: 2), wherein X represents any amino 

10 acid which is selected from the group comprising the 

twenty naturally occurring amino acids and n represents 
at least 2 , but not more than 4 . In a preferred 
embodiment, the amino acid sequence is SLGI (Sequence 
I.D. No.: 3). Further, the invention provides for a 

15 composition when the signal -transducing protein has at 

its carboxyl terminus the amino acid sequence (S/T) -X- 
(V/I/L) (Sequence I.D. No.: 4) , wherein each - represents 
a peptide bond, each parenthesis encloses amino acids 
which are alternatives to one other, each slash within 

20 such parentheses separating the alternative amino acids, 

and the X represents any amino acid which is selected 
from the group comprising the twenty naturally occurring 
amino acids. 

25 This invention also provides for a method of identifying 

a compound capable of inhibiting specific binding between 
a signal- transducing protein and a cytoplasmic protein 
containing the amino acid sequence (G/S/A/E) -L-G- (F/I/L) . 
Further this invention provides for a method of 

30 identifying a compound capable of inhibiting specific 

binding between a signal -transducing protein having at 
its carboxyl terminus the amino acid sequence (S/T) -X- 
(V/L/I) and a cytoplasmic protein. 

35 This invention also provides for a method inhibiting the 

proliferation of cancer cells, specifically, where the 
cancer cells are derived from organs comprising the 
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colon, liver, breast, ovary, testis, lung, stomach, 
spleen, kidney, prostate, uterus, skin, head, thymus and 
neck, or the cells are derived from either T-cells or B- 
cells . 

5 

This invention also provides for a method of treating 
cancer in a subject in an amount of the composition of 
effective to result in apoptosis of the cells, 
specifically, where the cancer cells are derived from 
10 organs comprising the thymus, colon, liver, breast, 

ovary, testis, lung, stomach, spleen, kidney, prostate, 
uterus, skin, head and neck, or the cells are derived 
from either T-cells or B-cells. 



15 This invention also provides for a method of inhibiting 

the proliferation of virally infected cells, specifically 
wherein the virally infected cells are infected with the 
Hepatitis B virus, Epstein-Barr virus, influenza virus, 
Papilloma virus, Adenovirus, Human T-cell lymphtropic 

20 virus, type 1 or HIV. 



This invention also provides a pharmaceutical composition 
comprising compositions capable of inhibiting specific 
binding between a signal -transducing protein and a 
25 cytoplasmic protein. 

This invention also provides a pharmaceutical composition 
comprising compounds identified to be capable of 
inhibiting specific binding between a signal -transducing 
30 protein and a cytoplasmic protein. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1. Diagram of Fas-associated phosphatase-1 
protein, showing the six GLGF { PDZ/DHR) domain repeats ; 
comparison of similar membrane binding sites with other 
proteins and proteins that contain GLGF (PDZ/DHR) 
repeats . 



Figures 2A, 2B, 2C and 2D. Mapping of the minimal region 
10 of the C- terminal of Fas required for the binding to 

FAP- 1 . Numbers at right show each independent clone 
(Figures 2C and 2D) . 

2A. Strategy for screening of a random peptide library 

by the yeast two-hybrid system. 
15 2B. Alignment of the C- terminal 15 amino acids of Fas 

between human (Sequence I.D. No.: 5), rat (Sequence 

I.D. No.: 6), and mouse (Sequence I.D. No.: 7). 
2C. The results of screening a semi-random peptide 

library. Top row indicates the amino acids which 
20 were fixed based on the homology between human and 

rat. Dash lines show unchanged amino acids. 
2D. The results of screening a random peptide library 

(Sequence I.D. No. 

Sequence I.D. No. 
25 Sequence I.D. No. 

Sequence I.D. No. 

Sequence I.D. No . 

respectively) . 



8, 


Sequence 


I.D. 


No. : 


9, 


10, 


Sequence 


I.D. 


No. : 


11, 


12, 


Sequence 


I.D. 


No. : 


13, 


14, 


Sequence 


I.D. 


No. : 


15, 


16, 


Sequence 


I.D. 


No. : 


17, 



30 Figures 3 A, 3B and 3C. Inhibition assay of Fas/FAP-1 

binding in vitro. 

3A. Inhibition assay of Fas/FAP-1 binding using the 
C-terminal 15 amino acids of Fas. GST-Fas fusion 
protein (191-355) was used for in vitro binding 
35 assay (lane 1, 3-10) . GST-Fas fusion protein 

(191-320) (lane 2) and 1 mM human PAMP (N-terminal 
20 amino acids of proadrenomedullin, M.W. 2460.9) 
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(lane 3) were used as negative controls. The 
concentrations of the C-terminal 15 amino acids 
added were 1 (lane 4), 3 (lane 5), 10 (lane 6), 30 
(lane 7) , 100 (lane 8), 300 (lane 9), and 1000 fiM 
5 (lane 10) . 

3B. Inhibition assay of Fas/FAP-1 binding using the 
truncated peptides corresponding to the C-terminal 
15 amino acids of Fas. All synthetic peptides were 
acetylated for this inhibition assay (Sequence I.D. 
10 No.: 4, Sequence I.D. No.: 18, Sequence I.D. No.: 

19, Sequence I.D. No.: 20, Sequence I.D. No.: 21, 
Sequence I.D. No.: 22, Sequence I.D. No.: 23, 
respectively) . 

3C. Inhibitory effect of Fas/FAP-1 binding using the 
15 scanned tripeptides. 

Figures 4A, 4B, 4C and 4D. 

4A. Interaction of the C-terminal 3 amino acids of Fas 

with FAP-1 in yeast. 
20 4B. Interaction of the C-terminal 3 amino acids of Fas 

with FAP-1 in vitro. 
4C. Immuno-precipitation of native Fas with GST-FAP-1. 
4D. Inhibition of Fas/FAP-1 binding with Ac-SLV or Ac- 

SLY. 

25 

Figures 5A, 5B, 5C, 5D # 5E and 5F. Microinjection of 
Ac-SLV into the DLD-1 cell line. Triangles identify the 
cells both that were could be microinjected with Ac-SLV 
and that showed condensed chromatin identified. On the 
30 other hand, only one cell of the area appeared apoptotic 

when microinjected with Ac -SLY. 

5A. Representative examples of the cells microinjected 
with Ac-SLV in the presence of 500 ng/ml CH11 are 
shown in phase contrast . 
35 5B. Representative examples of the cells microinjected 

with AC-SLY in the presence of 500 ng/ml CH11 are 
shown in phase contrast. 
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5C. Representative examples of the cells microinjected 
with Ac-SLV in the presence of 500 ng/ml CH11 are 
shown stained with FITC. 
5D. Representative examples of the cells microinjected 
5 with AC -SLY in the presence of 500 ng/ml CH11 are 

shown stained with FITC. 
5E. Representative examples of the cells microinjected 
with Ac-SLV in the presence of 500 ng/ml CH11 are 
shown with fluorescent DNA staining with Hoechst 
10 33342. 

5F. Representative examples of the cells microinjected 
with AC-SLY in the presence of 500 ng/ml CH11 are 
shown in fluorescent DNA staining with Hoechst 
33342 . 

15 

Figure 6. Quantitation of apoptosis in microinjected 
DLD-1 cells. 

Figures 7A, 7B, 7C, 7D, 7E, 7F, 7G, and 7H. 

20 7A. Amino acid sequence of human nerve growth factor 

receptor (Sequence I.D. No.: 24). 
7B. Amino acid sequence of human CD4 receptor (Sequence 
I .D. No. 25) . 

7C. The interaction of Fas-associated phosphatase-1 to 
25 the C-terminal of nerve growth factor receptor 

(NGFR) (p75) . 

7D. Amino acid sequence of human colorectal mutant 

cancer protein (Sequence I.D. No.: 26). 
7E. Amino acid sequence of protein kinase C, alpha type. 
30 7F. Amino acid sequence of serotonin 2A receptor 

(Sequence I.D. No.: 27). 
7G. Amino acid sequence of serotonin 2B receptor 

(Sequence I.D. No.: 28). 
7H. Amino acid sequence of adenomatosis polyposis coli 
35 protein (Sequence I.D. No.: 29). 
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Figure 8. Representation of the structural 
characteristics of p75 NGFR (low-affinity nerve growth 
factor receptor) . 

5 Figure 9. Comparison of the C-terminal ends of Fas and 

p75 NGFR. 

Figure 10. In vitro interaction of 35 S-labeled FAP-1 with 
various receptors expressed as GST fusion proteins. The 

10 indicated GST fusion proteins immobilized on glutathione- 

Sepharose beads were incubated with in vitro translated, 
3& S- labeled FAP-1 protein. After the beads were washed, 
retained FAP-1 protein was analyzed by SDS-PAGE and 
autoradiography . 

15 

Figures 11A and 11B. In vitro interaction 35 S-labeled 
FAP-1 with GST-p75 deletion mutants. 

IIA. Schematic representation of the GST fusion 
proteins containing the cytoplasmic domains of 

20 p75 and p75 deletion mutants. Binding of FAP- 

1 to the GST fusion proteins with various p75 
deletion mutants is depicted at the right and 
is based on data from (11B) . 

IIB. Interaction of in vitro translated, 35 S- labeled 
25 FAP-1 protein with various GST fusion proteins 

immobilized on glutathione -Sepharose beads. 
After the beads were washed, retained FAP-1 
protein was analyzed by SDS-PAGE and 
autoradiography . 

30 

Figure 12. The association between LexA- C-terminal 
cytoplasmic region of p75NGFR and VP16-FAP-1. The 
indicated yeast strains were constructed by 
transformation and the growth of colonies was tested. 
35 +/- indicates the growth of colonies on his _ plate. 
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10 



15 



25 



30 



DETAILED DESCRIP TION OP THE INVEMTT mi 

As used herein, amino acid residues are abbreviated as 
follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; 
H, His; I, He; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; 
Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, 
Tyr. 

In order to facilitate an understanding of the material 
which follows, certain frequently occurring methods 
and/or terms are best described in Sambrook, et al., 
1989. 



The present invention provides for a composition capable 
of inhibiting specific binding between a signal - 
transducing protein and a cytoplasmic protein containing 
the amino acid sequence (G/S/A/E) -L-G- (F/I/L) , wherein 
each - represents a peptide bond, each parenthesis 
20 encloses amino acids which are alternatives to one other, 

and each slash within such parentheses separating the 
alternative amino acids. Further, the cytoplasmic 
protein may contain the amino acid sequence (K/R/Q) -x„- 
(G/S/A/E)-L-G- (F/I/L) , wherein X represents any amino acid 
which is selected from the group comprising the twenty 
naturally occurring amino acids and n represents at least 
2, but not more than 4. Specifically, in a preferred 
embodiment, the cytoplasmic protein contains the amino 
acid sequence SLGI . 



The amino acid sequence (K/R/Q) -X n - (G/S/A/E) -L-G- (F/I/L) 
is also well-known in the art as "GLGF (PDZ/DHR) amino 
acid domain." As used herein, "GLGF (PDZ/DHR) amino acid 
domain" means the amino acid sequence (K/R/Q) -x„- 
35 (G/S/A/E) -L-G- (F/I/L) . 

In a preferred embodiment, the signal -transducing protein 
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has at its carboxyl terminus the amino acid sequence 
(S/T) -X- (V/I/L) , wherein each - represents a peptide 
bond, each parenthesis encloses amino acids which are 
alternatives to one other, each slash within such 
5 parentheses separating the alternative amino acids, and 

the X represents any amino acid which is selected from 
the group comprising the twenty naturally occurring amino 
acids . 

10 The compositions of the subject invention may be, but not 

limited to, antibodies, inorganic compounds, organic 
compounds, peptides, peptidomimetic compounds, 
polypeptides or proteins, fragments or derivatives which 
share some or all properties, e.g. fusion proteins. The 

15 composition may be naturally occurring and obtained by 

purification, or may be non-naturally occurring and 
obtained by synthesis. 

Specifically, the composition may be a peptide containing 
20 the sequence (S/T) -X- (V/I/L) -COOH, wherein each 

represents a peptide bond, each parenthesis encloses 
amino acids which are alternatives to one other, each 
slash within such parentheses separating the alternative 
amino acids, the X represents any amino acid which is 
25 selected from the group comprising the twenty naturally 

occurring amino acids. In preferred embodiments, the 
peptide contains one of the following sequences: 
DSENSNFRNE I QSLV , RNEIQSLV, NEIQSLV, EIQSLV, IQSLV, QSLV, 
SLV, I PPDSEDGNEEQSLV , DSEMYNFRSQLAS W , IDLASEFLFLSNSFL, 
30 PPTCSQANSGRISTL, SDSNMNMNELSEV , QNFRTYI VSFV , RETIESTV, 

RGFISSLV, TIQSVI, ESLV. A further preferred embodiment 
would be an organic compound which has the sequence Ac- 
SLV-COOH, wherein the Ac represents an acetyl and each - 
represents a peptide bond. 

35 

An example of the subject invention is provided infra . 
Acetylated peptides may be automatically synthesized on 
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an Advanced ChemTech ACT357 using previously published 
procedures by analogy. Wang resin was used for each run 
and N^-Fmoc protection was used for all amino acids, and 
then 20% piperidine/DMF and coupling was completed using 
5 DIC/HOBt and subsequently HBTU/DIEA. After the last 

amino acid was coupled, the growing peptide on the resin 
was acetylated with Ac 2 0/DMF. The acetylated peptide was 
purified by HPLC and characterized by FAB-MS and 1 H-NMR . 

10 Further, one skilled in the art would know how to 

construct derivatives of the above -described synthetic 
peptides coupled to non-acetyl groups, such as amines. 

This invention also provides for a composition capable of 
15 inhibiting specific binding between a signal-transducing 

protein having at its carboxyl terminus the amino acid 
sequence (S/T) -X- (V/I/L) , wherein each - represents a 
peptide bond, each parenthesis encloses amino acids which 
are alternatives to one other, each slash within such 
20 parentheses separating the alternative amino acids, the 
X represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 
acids, and a cytoplasmic protein. 

25 The compositions of the subject invention includes 
ant ibodies , inorganic compounds , organic compounds , 
peptides, peptidomimetic compounds, polypeptides or 
proteins, fragments or derivatives which share some or 
all properties, e.g. fusion proteins. 

30 

This invention also provides a method of identifying a 
compound capable of inhibiting specific binding between 
a signal -transducing protein and a cytoplasmic protein 
containing the amino acid sequence (G/S/A/E) -L-G- (F/I/L) , 
35 wherein each - represents a peptide bond, each 

parenthesis encloses amino acids which are alternatives 
to one other, each slash within such parentheses 
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separating the alternative amino acids, which comprises 
(a) contacting the cytoplasmic protein bound to the 
signal-transducing protein with a plurality of compounds 
under conditions permitting binding between a known 
5 compound previously shown to be able to displace the 

signal -transducing protein bound to the cytoplasmic 
protein and the bound cytoplasmic protein to form a 
complex; and (b) detecting the displaced signal - 
transducing protein or the complex formed in step (a) 
10 wherein the displacement indicates that the compound is 

capable of inhibiting specific binding between the 
signal -transducing protein and the cytoplasmic protein. 

The inhibition of the specific binding between the 
15 signal -transducing protein and the cytoplasmic protein 

may affect the transcription activity of a reporter gene. 

Further, in step (b) , the displaced cytoplasmic protein 
or the complex is detected by comparing the transcription 

20 activity of a reporter gene before and after the 

contacting with the compound in step (a) , where a change 
of the activity indicates that the specific binding 
between the signal -transducing protein and the 
cytoplasmic protein is inhibited and the signal - 

25 transducing protein is displaced. 

As used herein, the * transcript ion activity of a reporter 
gene" means that the expression level of the reporter 
gene will be altered from the level observed when the 

30 signal -transducing protein and the cytoplasmic protein 

are bound. One can also identify the compound by 
detecting other biological functions dependent on the 
binding between the signal -transducing protein and the 
cytoplasmic protein. Examples of reporter genes are 

35 numerous and well-known in the art, including, but not 
limited to, histidine resistant genes, ampicillin 
resistant genes, /3-galactosidase gene. 
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Further the cytoplasmic protein may be bound to a solid 
support. Also the compound may be bound to a solid 
support and comprises an antibody, an inorganic compound, 
an organic compound, a peptide, a peptidomimetic 
5 compound, a polypeptide or a protein. 



An example of the method is provided infra . One can 
identify a compound capable of inhibiting specific 
binding between the signal -transducing protein and the 

10 cytoplasmic protein using direct methods of detection 

such as immuno-precipitation of the cytoplasmic protein 
and the compound bound to a detectable marker. Further, 
one could use indirect methods of detection that would 
detect the increase or decrease in levels of gene 

15 expression. As discussed infra , one could construct 

synthetic peptides fused to a LexA DNA binding domain. 
These constructs would be transformed into the L4 0- strain 
with an appropriate cell line having an appropriate 
reporter gene. One could then detect whether inhibition 

20 had occurred by detecting the levels of expression of the 

reporter gene. In order to detect the expression levels 
of the reporter gene, one skilled in the art could employ 
a variety of well-known methods, e.g. two-hybrid systems 
in yeast, mammals or other cells. 

25 

Further, the contacting of step (a) may be in vitro , in 
vivo, and specifically in an appropriate cell, e.g. yeast 
cell or mammalian cell. Examples of mammalian cells 
include, but not limited to, the mouse fibroblast cell 
30 NIH 3T3, CHO cells, HeLa cells, Ltk" cells, Cos cells, 

etc. 



Other suitable cells include, but are not limited to, 
prokaryotic or eukaryotic cells, e.g. bacterial cells 
35 (including gram positive cells), fungal cells, insect 

cells, and other animals cells. 
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Further, the signal -transducing protein may be a cell 
surface receptor, signal transducer protein, or a tumor 
suppressor protein. Specifically, the cell surface 
protein is the Fas receptor and may be expressed in cells 
5 derived from organs including, but not limited to, 

thymus, liver, kidney, colon, ovary, breast, testis, 
spleen, lung, stomach, prostate, uterus, skin, head, and 
neck, or expressed in cells comprising T-cells and B- 
cells. In a preferred embodiment, the T-cells are Jurkat 
10 T-cells. 

Further, the cell -surface receptor may be a CD4 receptor, 
p75 receptor, serotonin 2 A receptor, or serotonin 2B 
receptor. 

15 

Further, the signal transducer protein may be Protein 
Kinase-C-a-type . 

Further, the tumor suppressor protein may be a 
20 adenomatosis polyposis coli tumor suppressor protein or 

colorectal mutant cancer protein. 

Further, the cytoplasmic protein contains the amino acid 
sequence SLGI, specifically Fas-associated phosphatase- 1 . 

25 

This invention also provides a method of identifying a 
compound capable of inhibiting specific binding between 
a signal -transducing protein having at its carboxyl 
terminus the amino acid sequence (S/T) -X- (V/I/L) , wherein 

30 each - represents a peptide bond, each parenthesis 

encloses amino acids which are alternatives to one other, 
each slash within such parentheses separating the 
alternative amino acids, the X represents any amino acid 
which is selected from the group comprising the twenty 

35 naturally occurring amino acids, and a cytoplasmic 

protein which comprises (a) contacting the signal- 
transducing protein bound to the cytoplasmic protein with 
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a plurality of compounds under conditions permitting 
binding between a known compound previously shown to be 
able to displace the cytoplasmic protein bound to the 
signal -transducing protein and bound signal -transducing 
5 protein to form a complex; and (b) detecting the 
displaced cytoplasmic protein or the complex of step (a) , 
wherein the displacement indicates that the compound is 
capable of inhibiting specific binding between the 
signal -transducing protein and the cytoplasmic protein. 

10 The inhibition of the specific binding between the 

signal -transducing protein and the cytoplasmic protein 
affects the transcription activity of a reporter gene. 
Further, in step (b) , the displaced signal -transducing 
protein or the complex is detected by comparing the 

15 transcription activity of a reporter gene before and 

after the contacting with the compound in step (a) , where 
a change of the activity indicates that the specific 
binding between the signal -transducing protein and the 
cytoplasmic protein is inhibited and the cytoplasmic 

20 protein is displaced. 

Further, in step (b) , the displaced cytoplasmic protein 
or the complex is detected by comparing the transcription 
activity of a reporter gene before and after the 
25 contacting with the compound in step (a) , where a change 

of the activity indicates that the specific binding 
between the signal -transducing protein and the 
cytoplasmic protein is inhibited and the signal - 
transducing protein is displaced. 

30 

As used herein, the "transcription activity of a reporter 
gene" means that the expression level of the reporter 
gene will be altered from the level observed when the 
signal -transducing protein and the cytoplasmic protein 
3 5 are bound. One can also identify the compound by 
detecting other biological functions dependent on the 
binding between the signal -transducing protein and the 



WO 98/05347 PCT/US97/12677 

-17- 

cytoplasmic protein. Examples of reporter genes are 
numerous and well-known in the art, including, but not 
limited to, histidine resistant genes, ampicillin 
resistant genes, j6-galactosidase gene. 

5 

Further, the cytoplasmic protein may be bound to a solid 
support or the compound may be bound to a solid support, 
comprises an antibody, an inorganic compound, an organic 
compound, a peptide, a peptidomimetic compound, a 
10 polypeptide or a protein. 



An example of the method is provided infra. One could 
identify a compound capable of inhibiting specific 
binding between the signal -transducing protein and the 
15 cytoplasmic protein using direct Methods of detection 

such as immuno-precipitation of the cytoplasmic protein 
and the compound bound with a detectable marker. 
Further, one could use indirect methods of detection that 
would detect the increase or decrease in levels of gene 
20 expression. As discussed infra , one could construct 

synthetic peptides fused to a LexA DNA binding domain. 
These constructs would be transformed into L40-strain 
with an appropriate cell line having a reporter gene. 
One could then detect whether inhibition had occurred by 
25 detecting the levels of the reporter gene. Different 

methods are also well known in the art, such as employing 
a yeast two-hybrid system to detect the expression of a 
reporter gene. 

Further the contacting of step (a) can be in vitro or in 
vivo , specifically in a yeast cell or a mammalian cell. 
Examples of mammalian cells include, but not limited to, 
the mouse fibroblast cell NIH 3T3, CHO cells, HeLa cells, 
Ltk" cells, Cos cells, etc. 

Other suitable cells include, but are not limited to, 
prokaryotic or eukaryotic cells, e.g. bacterial cells 



30 



35 
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( including gram positive cells), fungal cells, insect 
cells, and other animals cells. 



Further, the signal -transducing protein is a cell surface 
5 receptor, signal transducer protein, or a tumor 
suppressor protein. Specifically, the cell surface 
protein is the Fas receptor and is expressed in cells 
derived from organs comprising thymus, liver, kidney, 
colon, ovary, breast, testis, spleen, stomach, prostate, 
10 uterus, skin, head and neck, or expressed in cells 

comprising T-cells and B-cells. in a preferred 
embodiment, the T-cells are Jurkat T-cells. 



Further, the cell -surface receptor may be a CD4 receptor, 
15 p75 receptor, serotonin 2A receptor, or serotonin 2B 

receptor* 

Further, the signal transducer protein may be Protein 
Kinase- C- a- type . 

20 

Further, the tumor suppressor protein may be a 
adenomatosis polyposis coli tumor suppressor protein or 
colorectal mutant cancer protein. 

25 Further, the cytoplasmic protein contains the amino acid 

sequence SLGI, specifically Fas -associated phosphatase - 
1 . 



This invention also provides a method of inhibiting the 
30 proliferation of cancer cells comprising the above- 

described composition, specifically, wherein the cancer 
cells are derived from organs including, but not limited 
to, thymus, liver, kidney, colon, ovary, breast, testis, 
spleen, stomach, prostate, uterus, skin, head and neck, 
35 or wherein the cancer cells are derived from cells 

comprising T-cells and B-cells. 
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This invention also provides a method of inhibiting the 
proliferation of cancer cells comprising the compound 
identified by the above -described method, wherein the 
cancer cells are derived from organs including, but not 
5 limited to, thymus, liver, kidney, colon, ovary, breast, 

testis, spleen, stomach, prostate, uterus, skin, head and 
neck, or wherein the cancer cells are derived from cells 
comprising T-cells and B-cells. 

10 The invention also provides a method of treating cancer 

in a subject which comprises introducing to the subject's 
cancerous cells an amount of the above-described 
composition effective to result in apoptosis of the 
cells, wherein the cancer cells are derived from organs 

15 including, but not limited to, thymus, liver, kidney, 

colon, ovary, breast, testis, spleen, stomach, prostate, 
uterus, skin, head and neck, or wherein the cancer cells 
are derived from cells comprising T-cells and B-cells. 

20 As used herein "apoptosis" means programmed cell death of 
the cell. The mechanisms and effects of programmed cell 
death differs from cell lysis. Some observable effects 
of apoptosis are: DNA fragmentation and disintegration 
into small membrane -bound fragments called apoptotic 

25 bodies. 

Means of detecting whether the composition has been 
effective to result in apoptosis of the cells are well- 
known in the art. One means is by assessing the 
30 morphological change of chromatin using either phase 

contrast or fluorescence microscopy. 

The invention also provides for a method of inhibiting 
the proliferation of virally infected cells comprising 
35 the above -described composition or the compound 

identified by the above-described, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein-Barr 
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virus, influenza virus, Papilloma virus, Adeno virus, 
Human T-cell lymphtropic virus, type 1 or HIV. 



The invention also provides a method of treating a 
5 virally- infected subject which comprises introducing to 
the subject's virally- infected cells the above-described 
composition effective to result in apoptosis of the cells 
or the compound identified by the above-described method 
of claim 27 effective to result in apoptosis of the 
10 cells, wherein the virally infected cells comprise the 

Hepatitis B virus, Epstein-Barr virus, influenza virus, 
Papilloma virus, Adeno virus, Human T-cell lymphtropic 
virus, type 1 or HIV. 

15 Means of detecting whether the composition has been 
effective to result in apoptosis of the cells are well- 
known in the art. One means is by assessing the 
morphological change of chromatin using either phase 
contrast or fluorescence microscopy. 

20 

This invention also provides for a pharmaceutical 
composition comprising the above -described composition of 
in an effective amount and a pharmaceutically acceptable 
carrier. 

25 

This invention also provides for a pharmaceutical 
composition comprising the compound identified by the 
above -described method of in an effective amount and a 
pharmaceutically acceptable carrier. 

30 

This invention further provides a composition capable of 
specifically binding a signal-transducing protein having 
at its carboxyl terminus the amino acid sequence (S/T) -X- 
(V/Ii/I), wherein each - represents a peptide bond, each 
35 parenthesis encloses amino acids which are alternatives 
to one other, each slash within such parentheses 
separating the alternative amino acids, and the X 
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represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 
acids. The composition may contain the amino acid 
sequence (G/S/A/E) -L-G- (F/I/L) , wherein each - represents 
5 a peptide bond, each parenthesis encloses amino acids 

which are alternatives to one other, and each slash 
within such parentheses separating the alternative amino 
acids. In a preferred embodiment, the composition 
contains the amino acid sequence (K/R/Q) -Xj,- (G/S/A/E) -L-G- 
10 (F/I/L) . wherein X represents any amino acid which is 

selected from the group comprising the twenty naturally 
occurring amino acids and n represents at least 2, but 
not more than 4. In another preferred embodiment, the 
composition contains the amino acid sequence SLGI . 

15 

This invention further provides a method for identifying 
compounds capable of binding to a signal -transducing 
protein having at its carboxyl terminus the amino acid 
sequence (S/T) -X- (V/L/I) , wherein each - represents a 

20 peptide bond, each parenthesis encloses amino acids which 
are alternatives to one other, each slash within such 
parentheses separating the alternative amino acids, the 
X represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 

25 acids, which comprises (a) contacting the signal- 

transducing protein with a plurality of compounds under 
conditions permitting binding between a known compound 
previously shown to be able to bind to the signal - 
transducing protein to form a complex; and (b) detecting 

30 the complex formed in step (a) so as to identify a 

compound capable of binding to the signal - transducing 
protein. Specifically, the identified compound contains 
the amino acid sequence (G/S/A/E) -L-G- (F/I/L) , In a 
further preferred embodiment, the identified compound 

35 contains the amino acid sequence SLGI. 

Further, in the above -de scribed method, the signal- 
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transducing protein may be bound to a solid support. 
Also, the compound may be bound to a solid support, and 
may comprise an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic compound, 
5 a polypeptide or a protein. 

Further, the signal -transducing protein may be a cell- 
surface receptor or a signal transducer. Specifically, 
the signal -transducing protein may be the Fas receptor, 
10 CD4 receptor, p75 receptor, serotonin 2A receptor, 

serotonin 2B receptor, or protein kinase-C-or-type . 

This invention also provides a method of restoring 
negative regulation of apoptosis in a cell comprising the 
15 above-described composition or a compound identified by 

the above -described method. 

As used herein "restoring negative regulation of 
apoptosis" means enabling the cell from proceeding onto 
20 programmed cell death. 

For example, cells that have functional Fas receptors and 
Fas -associated phosphatase 1 do not proceed onto 
programmed cell death or apoptosis due to the negative 

25 regulation of Fas by the phosphatase. However, if Fas- 

associated phosphatase 1 is unable to bind to the 
carboxyl terminus of the Fas receptor ( (S/T) -X- (V/L/I ) 
region) , e.g. mutation or deletion of at least one of 
the amino acids in the amino acid sequence (G/S/A/E) -L-G- 

30 (F/I/L) , the cell will proceed to apoptosis. By 

introducing a compound capable of binding to the carboxyl 
terminus of the Fas receptor, one could mimic the effects 
of a functional phosphatase and thus restore the negative 
regulation of apoptosis. 

35 

This invention also provides a method of preventing 
apoptosis in a cell comprising the above-described 
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composition or a compound identified by the above - 
described method. 



This invention also provides a means of treating 
5 pathogenic conditions caused by apoptosis of relevant 
cells comprising the above -described composition or the 
compound identified by the above- described method. 

This invention is illustrated in the Experimental Details 
10 section which follows. These sections are set forth to 

aid in an understanding of the invention but are not 
intended to, and should not be construed to, limit in any 
way the invention as set forth in the claims which follow 
thereafter. 
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Experimental Details 
5 Methods and Materials 

1. Screening a semi-random and random peptide library. 

To create numerous mutations in a restricted DNA 
10 sequence, PCR mutagenesis with degenerate 

oligonucleotides was employed according to a protocol 
described elsewhere (Hill, et al . 1987). Based on the 
homology between human and rat, two palindromic sequences 
were designed for construction of semi-random library. 
15 The two primers used were 

5' - CGGAATTCNNNNNNNNNAACAGCNNNN^ 

NTG AGGATCC TCA-3 ' (Seq. I.D. No.: 30) and 

5' - CGGAATTCGACTCAGAANNNNNNAACTTC^ 

CTGAGGATCCTCA- 3 ' (Seq. I.D. No.: 31). Briefly, the two 

20 primers (each 200 pmol) , purified by HPLC, were annealed 

at 70 °C for 5 minutes and cooled at 23 °C for 60 minutes. 
A Klenow fragment (5 U) was used for filling in with a 
dNTP mix (final concentration, 1 mM per each dNTP) at 
23°C for 60 minutes. The reaction was stopped with 1 ^1 

25 of 0.5 M EDTA and the DNA was purified with ethanol 

precipitation. The resulting double- stranded DNA was 
digested with EcoRI and BamHI and re -purified by 
electrophoresis on non- denaturing polyacrylamide gels. 
The double-strand oligonucleotides were then ligated into 

3 0 the EcoRI -BamHI sites of the pBTM116 plasmid. The 

ligation mixtures were electroporated into the E. coli 
XLl-Blue MRF' (Stratagene) for the plasmid library. The 
large scale transformation was carried out as previously 
reported. The plasmid library was transformed into 

35 L40-strain cells (MATa, trpl, leu2, his3, ade2, 

LYS2: (lexAop) 4 -HIS3, URA3: : (lexAopf -lacZ) carrying the 
plasmid pVP16-31 containing a FAP-1 cDNA (Sato, et al . 



i 
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1995) . Clones that formed on histidine -deficient medium 
(His*) were transferred to plates containing 40 /xg/ml 
X-gal to test for a blue reaction product (S-gal + ) in 
plate and filter assays. The clones selected by His* and 
5 fc-gal* assay were tested for further analysis. The 

palindromic oligonucleotide, 
5 ' - CGGAATTC- (NNN) . , c - TG AGGATCC TCA- 3 ' (Seq. I.D. No. 32), 
was used for the construction of the random peptide 
library. 

10 

2 . Synthesis of peptides 

Peptides were automatically synthesized on an Advanced 
ChemTech ACT357 by analogy to published procedures 

15 (Schnorrenberg and Gerhardt, 1989). Wang resin (0.2-0.3 

mmole scale) was used for each run and N°-Fmoc protection 
was employed for all amino acids. Deprotection was 
achieved by treatment with 20% piperidine/DMF and 
coupling was completed using DIC/HOBt and subsequent 

20 HBTU/DIEA. After the last amino acid was coupled, the 

growing peptide on the resin was acetylated with Ac 2 0/DMF. 
The peptide was cleaved from the resin with concomitant 
removal of all protecting groups by treating with TFA. 
The acetylated peptide was purified by HPLC and 

25 characterized by FAB-MS and 'H-NMR. 

3. Inhibition asssay of Fas/FAP-l binding using the C- 
terminal 15 amino acids of Fas. 

30 HFAP-10 cDNA (Sato # et al . 1995) subcloned into the 

Bluescript vector pSK-II (Stratagene) was in 
vitro-translated from an internal methionine codon in the 
presence of 35 S-L-methionine using a- coupled in vitro 
transcription/translation system (Promega, TNT lysate) 

35 and T7 RNA polymerase. The resulting 35 S-labeled protein 

was incubated with GST-Fas fusion proteins that had been 
immobilized on GST-Sepharose 4B affinity beads 
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(Pharmacia) in a buffer containing 150 mM NaCl, 50 mM 
Tris [pH 8.0], 5 mM DTT, 2 mM EDTA, 0.1 % NP-40, 1 mM 
PMSF, 50 /ig/ml leupeptin, 1 mM Benzamidine , and 7 /xg/ml 
pepstatin for 16 hours at 4 *C. After washing vigorously 
5 4 times in the same buffer, associated proteins were 

recovered with the glutathione -Sepharose beads by 
centrif ugation, eluted into boiling Laemmli buffer, and 
analyzed by SDS-PAGE and f luorography . 



10 4. Inhibition assay of terminal 15 amino acids of Fas 
and inhibitory effect of Fas/FAP-1 binding using 
diverse tripeptides . 



In vitro- translated [ 35 S]HFAP-1 was purified with a NAP- 5 
15 column (Pharmacia) and incubated with 3 jiM of GST- fusion 

proteins for 16 hours at 4*C. After washing 4 times in 
the binding buffer, radioactivity incorporation was 
determined in a b counter. The percentage of binding 
inhibition was calculated as follows: percent inhibition 
20 = [radioactivity incorporation using GST-Fas (191-335) 

with peptides - radioactivity incorporation using GST- Fas 
(191-320) with peptides] / [radioactivity incorporation 
using GST-Fas (191-335) without peptides - radioactivity 
incorporation using GST-Fas (191-320) without peptides] . 
25 n=3 . 

5. Interaction of the C-terminal 3 amino acids of Fas 
with FAP-1 in yeast and in vitro . 

30 The bait plasmids, pBTM116 (LexA) -SLV, -PLV, -SLY, and 

-SLA, were constructed and transformed into L40-strain 
with pVP16-FAP-l or -ras. Six independent clones from 
each transf ormants were picked up for the analysis of 
growth on histidine-def icient medium. GST-Fas, -SLV, and 

3 5 PLV were purified with GST-Sepharose 4B affinity beads 

(Pharmacia) . The methods for in vitro binding are 
described above. 



WO 98/05347 PCT/US97/12677 

-27- 

6. Immuno-precipitation of native Fas with GST-FAP-1 
and inhibition of Fas/FAP-1 binding with Ac-SLV. 



GST- fusion proteins with or without FAP-1 were incubated 
5 with cell extracts from Jurkat T-cells expressing Fas. 

The bound Fas was detected by Western analysis using 
anti-Fas monoclonal antibody (F22120, Transduction 
Laboratories) . The tripeptides, Ac-SLV and Ac-SLY were 
used for the inhibition assay of Fas/FAP-1 binding. 

10 

7. Microinjection of Ac-SLV into the DLD-1 cell line. 
DLD-1 human colon cancer cells were cultured in RPMI 164 0 
medium containing 10% FCS. For microinjection, cells 
were plated on CELLocate (Eppendorf ) at 1 X 10 5 cells/2 ml 

15 in a 35 mm plastic culture dish and grown for 1 day. Just 

before microinjection, Fas monoclonal antibodies CH11 
(MBL International) was added at the concentration of 500 
ng/ml. All microinjection experiments were performed 
using an automatic microinjection system (Eppendorf 

20 transjector 5246, micro-manipulator 5171 and Femtotips) 

(Pantel, et al . 1995). Synthetic tripeptides were 
suspended in 0.1% (w/v) FITC-Dextran (Sigma) /K-PBS at the 
concentration of 100 mM. The samples were microinjected 
into the cytoplasmic region of DLD-1 cells. Sixteen to 

25 2 0 hours post inject ion, the cells were washed with PBS 

and stained with 10 fig/ml Hoechst 33342 in PBS. After 
incubation at 37°C for 30 minutes, the cells were 
photographed and the cells showing condensed chromatin 
were counted as apoptotic. 

30 

8. Quantitation of apoptosis in microinjected DLD-1 
cells . 



For each experiment, 25-100 cells were microinjected. 
35 Apoptosis of microinjected cells was determined by 

assessing morphological changes of chromatin using phase 
contrast and fluorescence microscopy (Wang, et al., 1995; 
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McGahon, et al w 1995). The data are means + /- S.D. for 
two or three independent determinations. 



Discussion 

5 

In order to identify the minimal peptide stretch in the 
C- terminal region of the Fas receptor necessary for FAP-1 
binding, an in vitro inhibition assay of Fas/FAP-1 
binding was used using a series of synthetic peptides as 

10 well as yeast two-hybrid system peptide libraries (Figure 

2A) . First, semi-random libraries (based on the homology 
between human and rat Fas) (Figures 2B and 2C) of 15 
amino acids fused to a LexA DNA binding domain were 
constructed and co- trans formed into yeast strain L4 0 with 

15 pVP16-31 (Sato, et al. 1995) that was originally isolated 

as FAP-1. After the selection of 200 His* colonies from an 
initial screen of 5.0 X 10 6 (Johnson, et al. 1986) 
transf ormants, 100 colonies that were /?-galactosidase 
positive were picked for further analysis. Sequence 

20 analysis of the library plasmids encoding the C- terminal 

15 amino acids revealed that all of the C- termini were 
either valine, leucine or isoleucine residues. Second, 
a random library of 4-15 amino acids fused to a LexA DNA 
binding domain was constructed and screened according to 

25 this strategy (Figure 2D) . Surprisingly, all of the third 

amino acid residues from the C- termini were serine, and 
the results of C-terminal amino acid analyses were 
identical to the screening of the semi -random cDNA 
libraries. No other significant amino acid sequences were 

30 found in these library screenings, suggesting that the 

motifs of the last three amino acids (tS-X-V/L/I) are 
very important for the association with the third PD2 
domain of FAP-1 and play a crucial role in 
protein-protein interaction as well as for the regulation 

35 of Fas-induced apoptosis. To further confirm whether the 

last three amino acids are necessary and sufficient for 
Fas/FAP-1 binding, plasmids of the LexA-SLV, -PLV, -PLY, 
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-SLY, and -SLA fusion proteins were constructed and 
co-transformed into yeast with pVP16-FAP-l. The results 
showed that only LexA-SLV associated with FAP-l, whereas 
LexA-PLV, -PLY, -SLY, and -SLA did not (Figure 4A) . In 
5 vitro binding studies using various GST-tripeptide 
fusions and in vi tro-translated FAP-l were consistent 
with these results (Figure 4B) . 



In addition to yeast two-hybrid approaches, in vitro 

10 inhibition assay of Fas/FAP-1 binding was also used. 

First, a synthetic peptide of the C-terminal 15 amino 
acids was tested whether it could inhibit the binding of 
Fas and FAP-l in vitro (Figure 3 A) . The binding of in 
vitro- translated FAP-l to GST- Fas was dramatically 

15 reduced and dependent on the concentration of the 

synthetic 15 amino acids of Fas. In contrast with these 
results, human PAMP peptide (Kitamura, et al. 1994) as a * 
negative control had no effect on Fas/FAP-1 binding 
activity under the same biochemical conditions. Second, 

20 the effect of truncated C-terminal synthetic peptides of 

Fas on Fas/FAP-1 binding in vitro was examined. As shown 
in Figure 3B, only the three C-terminal amino acids 
(Ac-SLV) were sufficient to obtain the same level of 
inhibitory effect on the binding of FAP-l to Fas as 

25 achieved with the 4-15 synthetic peptides. Furthermore, 
Fas/FAP-1 binding was extensively investigated using the 
scanned tripeptides to determine the critical amino acids 
residues required for inhibition (Figure 3C) . The 
results revealed that the third amino acids residues from 

30 the C- terminus, and the C-terminal amino acids having the 

strongest inhibitory effect were either serine or 
threonine; and either valine, leucine, or isoleucine, 
respectively. However, there were no differences among 
the second amino acid residues from the C- terminus with 

35 respect to their inhibitory effect on Fas/FAP-1 binding. 

These results were consistent with those of the yeast 
two-hybrid system (Figures 2C and 2D) . Therefore, it was 
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concluded that the C- terminal three amino acids (SLV) are 
critical determinants of Fas binding to the third PDZ 
domain of FAP-1 protein. 

5 To further substantiate that the PDZ domain interacts 

with fcS/T-X-V/L/I under more native conditions, GST-fused 
FAP-1 proteins were tested for their ability to interact 
with Fas expressed in Jurkat T-cells. The results 
revealed that the tripeptide Ac-SLV, but not Ac-SLY, 

10 abolished in a dose -dependent manner the binding activity 

of FAP-1 to Fas proteins extracted from Jurkat T-cells 
(Figures 4C and 4D) . This suggests that the C- terminal 
amino acids tSLV are the minimum binding site for FAP-1, 
and that the amino acids serine and valine are critical 

15 for this physical association. 

To next examine the hypothesis that the physiological 
association between the C-terminal three amino acids of 
Fas and the third PDZ domain of FAP-1 is necessary for 

20 the in vivo function of FAP-1 as a negative regulator of 

Fas-mediated signal transduction, a microinjection 
experiment was employed with synthetic tripeptides in a 
colon cancer cell line, DLD-1, which expresses both Fas 
and FAP-1, and is resistant to Fas-induced apoptosis. 

25 The experiments involved the direct microinjection of the 

synthetic tripeptides into the cytoplasmic regions of 
single cells and the monitoring of the physiological 
response to Fas- induced apoptosis in vivo. The results 
showed that microinjection of Ac-SLV into DLD-1 cells 

30 dramatically induced apoptosis in the presence of 

Fas -monoclonal antibodies (CH11, 500 ng/ml) (Figures 5A, 
5E and Figure 6), but that microinjection of Ac-SLY and 
PBS/K did not (Figures 5B, 5F and Figure 6) . These 
results strongly support the hypothesis that the physical 

35 association of FAP-1 with the C- terminus of Fas is 

essential for protecting cells from Fas -induced 
apoptosis. 
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In summary, it was found that the C-terminal SLV of Fas 
is alone necessary and sufficient for binding to the 
third PD2 domain of FAP-1. Secondly, it is proposed that 
the new consensus motif of tS/T-X-V/L/I for such binding 
5 to the PDZ domain, instead of tS/T-X-V. It is therefore 

possible that FAP-1 plays important roles for the 
modulation of signal transduction pathways in addition to 
its physical interaction with Fas. Thirdly, it is 
demonstrated that the targeted induction of Fas-mediated 

10 apoptosis in colon cancer cells by direct microinjection 

of the tripeptide Ac-SLV. Further investigations 
including the identification of a substrate (s) of FAP-1 
and structure -function analysis will provide insight to 
the potential therapeutic applications of Fas/FAP-1 

15 interaction in cancer as well as provide a better 

understanding of the inhibitory effect of FAP-1 on 
Fas-mediated signal transduction. 
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SECOND SERIES OF EXPERIMENTS 

FAP-1 was originally identified as a membrane-associated 
protein tyrosine phosphatase which binds to the C- 
5 terminus of Fas, and possesses six PDZ domains (also 

known as DHR domain or GLGF repeat) . PDZ domain has 
recently been shown as a novel module for specific 
protein-protein interaction, and it appears to be 
important in the assembly of membrane proteins and also 

10 in linking signaling molecules in a multiprotein complex. 

In recent comprehensive studies, it was found that the 
third PDZ domain of FAP-1 specifically recognized the 
sequence motif t(S/T)-X-V and interacts with the C- 
terminal three amino acids SLV of Fas (Fig. 9) . In order 

15 to investigate the possibility that FAP-1 also interacts 

with the C-terminal region of p75NGFR (Fig. 8) , an in 
vitro binding assay, was performed as well as, a yeast 
two-hybrid analysis by using a series of deletion mutants 
of p75NGFR. The results revealed that the C-terminal 

20 cytoplasmic region of p75NGFR, which is highly conserved 

among all species, interacts with FAP-1 (Fig. 10) . 
Furthermore, the C-terminal three amino acids SPV of 
p75NGFR were necessary and sufficient for the interaction 
with the third PDZ domain of FAP-1 (Fig. 11A and 11B) . 

25 Since FAP-1 expression was found highest in fetal brain, 

these findings imply that interaction of FAP-1 with 
p75NGFR plays an important role for signal transduction 
pathway via p75NGFR in neuronal cells as well as in the 
formation of the initial signal-transducing complex for 

30 p75NGFR. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Takaaki Sato and Junn Yanagisawa 

<ii) TITLE OF INVENTION: COMPOUNDS THAT INHIBIT THE 

INTERACTION BETWEEN SIGN AL- 
IO TRANSDUCING PROTEINS AND THE GLGF 

(PDZ/DHR) DOMAIN AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 33 

15 <iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooper & Dunham LLP 

(B) STREET: 1185 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 
20 (E) COUNTRY: U.S.A. 

(F) ZIP: 10036 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
25 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 

<vi) CURRENT APPLICATION DATA: 
30 (A) APPLICATION NUMBER: Not Yet Known 

(B) FILING DATE: 18- JUL- 1997 

(C) CLASSIFICATION: 

(Viii) ATTORNEY/AGENT INFORMATION: 
35 (A) NAME: White, John P 

(B) REGISTRATION NUMBER: 28,678 

(C) REFERENCE / DOCKET NUMBER: 0575/48962-A-PCT/ JPW/JKM: 

(ix) TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: (212) 278-0400 

(B) TELEFAX: (212) 391-0525 

(2) INFORMATION FOR SEQ ID NO:l: 

45 ii) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

55 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Gly/Ser/Ala/Glu Leu Gly Phe/Ile/Leu 
60 1 



(2) INFORMATION FOR SEQ ID NO: 2: 

65 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Lys/Arg/Gln Xaa(n) Gly/Ser/Ala/Glu Leu Gly Phe/Ile/Leu 
1 5 



15 



25 



35 



45 



55 



65 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
20 (B> TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Ser Leu Gly lie 
1 



(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 6 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Ser/Thr Xaa Val/Ile/Leu 
l 



(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
60 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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Asp Ser Glu Asn Ser Asn Phe Arg Asn Glu lie Gin Ser Leu Val 
15 10 15 

5 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ser lie Ser Asn Ser Arg Asn Glu Asn Glu Gly Gin Ser Leu Glu 
15 10 15 



20 



30 



35 



€0 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ser Thr Pro Asp Thr Gly Asn Glu Asn Glu Gly Gin Cys Leu Glu 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Glu Ser Leu Val 
50 1 



(2) INFORMATION FOR SEQ ID NO: 9: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



65 Thr lie Gin Ser Val He 

1 5 
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(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



15 



50 



60 



(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Arg Gly Phe lie Ser Ser Leu Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Arg Glu Thr lie Glu Ser Thr Val 
30 l 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

4 5 Gin Asn Phe Arg Thr Tyr He Val Ser Phe Val 

15 10 



(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xil SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Ser Asp Ser Asn Met Asn Met Asn Glu Leu Ser Glu Val 
15 10 



65 (2) INFORMATION FOR SEQ ID NO: 14: 

<i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

5 

{ii> MOLECULE TYPE: peptide 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

10 Pro Pro Thr Cys Ser Gin Ala Asn Ser Gly Arg lie Ser Thr Leu 

15 10 15 



15 



25 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



He Asp Leu Ala Ser Glu Phe Leu Phe Leu Ser Asn Ser Phe Leu 
15 10 15 



30 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Asp Ser Glu Met Tyr Asn Phe Arg Ser Gin Leu Ala Ser Val Val 
15 10 15 

45 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

50 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

He Pro Pro Asp Ser Glu Asp Gly Asn Glu Glu Gin Ser Leu Val 
15 10 15 

60 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
65 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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20 



55 



65 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 : 

Gin Ser Leu Val 
1 

(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

lie Gin Ser Leu Val 
1 5 



25 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

30 <C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Glu He Gin Ser Leu Val 
1 5 

40 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Asn Glu He Gin Ser Leu Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
60 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
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Arg Asn Glu lie Gin Ser Leu Val 
1 5 

5 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Asp Ser Glu Asn Ser Asn Phe Arg Asn Glu lie Gin Ser Leu Val 
15 10 15 



20 



30 



35 



50 



65 



(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Met Gly Ala Gly Ala Thr Gly Arg Ala Met Asp Gly Pro Arg Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Gly Val Ser Leu Gly Gly Ala Lys Glu Ala Cys 
20 25 30 



Pro Thr Gly Leu Tyr Thr His Ser Gly Glu Cys Cys Lys Ala Cys Asn 
40 35 40 45 

Leu Gly Glu Gly Val Ala Gin Pro Cys Gly Ala Asn Gin Thr Val Cys 
50 55 60 

45 Glu Pro Cys Leu Asp Ser Val Thr Phe Ser Asp Val Val Ser Ala Thr 

65 70 75 80 



Glu Pro Cys Lys Pro Cys Thr Glu Cys Val Gly Leu Gin Ser Met Ser 
85 90 95 

Ala Pro Cys Val Glu Ala Asp Asp Ala Val Cys Arg Cys Ala Tyr Gly 
100 105 110 



Tyr Tyr Gin Asp Glu Thr Thr Gly Arg Cys Glu Ala Cys Arg Val Cys 
55 115 120 125 

Glu Ala Gly Ser Gly Leu Val Phe Ser Cys Gin Asp Lys Gin Asn Thr 
130 135 140 

60 Val Cys Glu Glu Cys Pro Asp Gly Thr Tyr Ser Asp Glu Ala Asn His 

145 150 155 160 



Val Asp Pro Cys Leu Pro Cys Thr Val Cys Glu Asp Thr Glu Arg Gin 
165 170 175 

Leu Arg Glu Cys Thr Arg Trp Ala Asp Ala Glu Cys Glu Glu lie Pro 

180 185 190 
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Gly Arg Trp II Thr Arg Ser Thr Pro Pro Glu Gly Ser Asp Ser Thr 
195 200 205 

Ala Pro Ser Thr Gin Glu Pro Glu Ala Pro Pro Glu Gin Asp Leu lie 
5 210 215 220 

Ala Ser Thr Val Ala Gly Val Val Thr Thr Val Met Gly Ser Ser Gin 
225 230 235 240 

10 Pro Val Val Thr Arg Gly Thr Thr Asp Asn Leu He Pro Val Tyr Cys 

245 250 255 



15 



30 



45 



Ser He Leu Ala Ala Val Val Val Gly Leu Val Ala Tyr He Ala Phe 
260 265 270 

Lys Arg Trp Asn Ser Cys Lys Gin Asn Lys Gly Gly Ala Asn Ser Arg 
275 280 265 



Pro Val Asn Gin Thr Pro Pro Pro Glu Gly Glu Lys He His Ser Asp 
20 290 295 300 

Ser Gly He Ser Val Asp Ser Gin Ser Leu His Asp Gin Gin Pro His 
305 310 315 320 

25 Thr Gin Thr Ala Ser Gly Gin Ala Leu Lys Gly Asp Gly Gly Leu Tyr 

325 330 335 



Ser Ser Leu Pro Pro Ala Lys Arg Glu Glu Val Glu Lys Leu Leu Asn 
340 345 350 

Gly Ser Ala Gly Asp Thr Trp Arg His Leu Ala Gly Glu Leu Gly Tyr 
355 360 365 



Gin Pro Glu His He Asp Ser Phe Thr His Glu Ala Cys Pro Val Arg 

35 370 375 380 

Ala Leu Leu Ala Ser Trp Ala Thr Gin Asp Ser Ala Thr Leu Asp Ala 

385 390 395 400 

40 Leu Leu Ala Ala Leu Arg Arg He Gin Arg Ala Asp Leu Val Glu Ser 

405 410 415 



Leu Cys Ser Glu Ser Thr Ala Thr Ser Pro Val 
420 425 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 
60 1 5 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

65 Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 

35 40 45 
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lie Gin Phe His Trp Lys Asn Ser Asn Gin lie Lys lie Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 
5 65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Pro Leu He He 
85 90 95 

10 Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 

100 105 110 



15 



30 



45 



60 



Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr He Thr Leu Glu 
130 135 140 



Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 
20 145 150 155 160 

Lys Asn He Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 
165 170 175 

25 Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 

180 185 190 



Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 
195 200 205 

Ser lie Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 
210 215 220 



Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 
35 225 230 235 240 

Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp He Thr Phe Asp Leu 
245 250 255 

40 Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 

260 265 270 



Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 
275 280 285 

Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 
290 295 300 



Thr Gly Lys Leu His Gin Glu Asn Val Leu Val Val Met Arg Ala Thr 
50 305 310 315 320 

Gin Leu Gin Lys Asn Leu Thr Cys Glu Val Trp Gly Pro Thr Ser Pro 
325 330 335 

55 Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu Ala Lys Val Ser 

340 345 350 



Lys Arg Glu Lys Ala Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
355 360 365 

Gin Cys Leu Leu Ser Asp Ser Gly Gin Val Leu Leu Glu Ser Asn He 
370 375 380 



Lys Val Leu Pro Thr Trp Ser Thr Pro Val Gin Pro Met Ala Leu He 
65 385 390 395 400 

Val Leu Gly Gly Val Ala Gly Leu Leu Leu Phe He Gly Leu Gly He 
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405 410 415 

Phe Phe Cys Val Arg Cys Arg His Arg Arg Arg Gin Ala Glu Arg Met 
420 425 430 

5 

Ser Gin lie Lys Arg Leu Leu Ser Glu Lys Lys Glu Cys Gin Cys Pro 
435 440 445 

His Arg Phe Gin Lys Thr Cys Ser Pro lie 
10 450 455 

(2) INFORMATION FOR SEQ ID NO: 26: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: peptide 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:26: 

25 Met Asn Ser Gly Val Ala Met Lys Tyr Gly Asn Asp Ser Ser Ala Glu 

15 10 15 

Leu Ser Glu Leu His Ser Ala Ala Leu Ala Ser Leu Lys Gly Asp lie 
20 25 30 



30 



45 



60 



Val Glu Leu Asn Lys Arg Leu Gin Gin Thr Glu Arg Glu Asp Leu Leu 
35 40 45 



Glu Lys Lys Leu Ala Lys Ala Gin Cys Glu Gin Ser His Leu Met Arg 
35 50 55 60 

Glu His Glu Asp Val Gin Glu Arg Thr Thr Leu Arg Tyr Glu Glu Arg 
65 70 75 80 

40 He Thr Glu Leu His Ser Val He Ala Glu Leu Asn Lys Lys He Asp 

85 90 95 



Arg Leu Gin Gly Thr Thr He Arg Glu Glu Asp Glu Tyr Ser Glu Leu 
100 105 110 

Arg Ser Glu Leu Ser Gin Ser Gin His Glu Val Asn Glu Asp Ser Arg 
115 120 125 



Ser Met Asp Gin Asp Gin Thr Ser Val Ser He Pro Glu Asn Gin Ser 
50 130 135 140 

Thr Met Val Thr Ala Asp Met Asp Asn Cys Ser Asp He Asn Ser Glu 
145 150 155 160 

55 Leu Gin Arg Val Leu Thr Gly Leu Glu Asn Val Val Cys Gly Arg Lys 

165 170 175 



Lys Ser Ser Cys Ser Leu Ser Val Ala Glu Val Asp Arg His He Glu 
180 185 190 

Gin Leu Thr Thr Ala Ser Glu His Cys Asp Leu Ala He Lys Thr Val 

195 200 205 



Glu Glu lie Glu Gly Val Leu Gly Arg Asp Leu Tyr Pro Asn Leu Ala 
65 210 215 220 

Glu Glu Arg Ser Arg Trp Glu Lys Glu Leu Ala Gly Leu Arg Glu Glu 
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225 230 235 240 

Asn Glu Ser Leu Thr Ala Met Leu Cys Ser Lys Glu Glu Glu Leu Asn 
245 250 255 

Arg Thr Lys Ala Thr Met Asn Ala lie Arg Glu Glu Arg Asp Arg Leu 
260 265 270 

Arg Arg Arg Val Arg Glu Leu Gin Thr Arg Leu Gin Ser Val Gin Ala 
275 280 285 

Thr Gly Pro Ser Ser Pro Gly Arg Leu Thr Ser Thr Asn Arg Pro lie 
290 295 300 

Asn Pro Ser Thr Gly Glu Leu Ser Thr Ser Ser Ser Ser Asn Asp lie 
305 310 315 320 

Pro lie Ala Lys lie Ala Glu Arg Val Lys Leu Ser Lys Thr Arg Ser 
325 330 335 

Glu Ser Ser Ser Ser Asp Arg Pro Val Leu Gly Ser Glu lie Ser Ser 
340 345 350 

He Gly Val Ser Ser Ser Val Ala Glu His Leu Ala His Ser Leu Gin 
355 360 365 

Asp Cys Ser Asn He Gin Glu He Phe Gin Thr Leu Tyr Ser His Gly 
370 375 380 

Ser Ala He Ser Glu Ser Lys He Arg Glu Phe Glu Val Glu Thr Glu 
385 390 395 400 

Arg Leu Asn Ser Arg He Glu His Leu Lys Ser Gin Asn Asp Leu Leu 
405 410 415 

Thr He Thr Leu Glu Glu Cys Lys Ser Asn Ala Glu Arg Met Ser Met 
420 425 430 

Leu Val Gly Lys Tyr Glu Ser Asn Ala Thr Ala Leu Arg Leu Ala Leu 
435 440 445 

Gin Tyr Ser Glu Gin Cys He Glu Ala Tyr Glu Leu Leu Leu Ala Leu 
450 455 460 

Ala Glu Ser Glu Gin Ser Leu He Leu Gly Gin Phe Arg Ala Ala Gly 
465 470 475 480 

Val Gly Ser Ser Pro Gly Asp Gin Ser Gly Asp Glu Asn He Thr Gin 
485 490 495 

Met Leu Lys Arg Ala His Asp Cys Arg Lys Thr Ala Glu Asn Ala Ala 
500 505 510 

Lys Ala Leu Leu Met Lys Leu Asp Gly Ser Cys Gly Gly Ala Phe Ala 
515 520 525 

Val Ala Gly Cys Ser Val Gin Pro Trp Glu Ser Leu Ser Ser Asn Ser 
530 535 540 

His Thr Ser Thr Thr Ser Ser Thr Ala Ser Ser Cys Asp Thr Glu Phe 
545 550 555 560 

Thr Lys Glu Asp Glu Gin Arg Leu Lys Asp Tyr He Gin Gin Leu Lys 
565 570 575 

Asn Asp Arg Ala Ala Val Lys Leu Thr Met Leu Glu Leu Glu Ser He 
580 585 590 
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His lie Asp Pro Leu Ser Tyr Asp Val Lys Pro Arg Gly Asp Ser Gin 
595 600 605 

Arg Leu Asp Leu Glu Asn Ala Val Leu Met Gin Glu Leu Met Ala Met 
5 610 615 620 

Lys Glu Glu Met Ala Glu Leu Lys Ala Gin Leu Tyr Leu Leu Glu Lys 
625 630 635 640 

10 Glu Lys Lys Ala Leu Glu Leu Lys Leu Ser Thr Arg Glu Ala Gin Glu 

645 650 655 



15 



30 



45 



Gin Ala Tyr Leu Val His He Glu His Leu Lys Ser Glu Val Glu Glu 
660 665 670 

Gin Lys Glu Gin Arg Met Arg Ser Leu Ser Ser Thr Ser Ser Gly Ser 
675 680 685 



Lys Asp Lys Pro Gly Lys Glu Cys Ala Asp Ala Ala Ser Pro Ala Leu 

20 690 695 700 

Ser Leu Ala Glu Leu Arg Thr Thr Cys Ser Glu Asn Glu Leu Ala Ala 
705 710 715 720 

25 Glu Phe Thr Asn Ala He Arg Arg Glu Lys Lys Leu Lys Ala Arg Val 

725 730 735 



Gin Glu Leu Val Ser Ala Leu Glu Arg Leu Thr Lys Ser Ser Glu He 
740 745 750 

Arg His Gin Gin Ser Ala Glu Phe Val Asn Asp Leu Lys Arg Ala Asn 
755 760 765 



Ser Asn Leu Val Ala Ala Tyr Glu Lys Ala Lys Lys Lys His Gin Asn 
35 770 775 780 

Lys Leu Lys Lys Leu Glu Ser Gin Met Met Ala Met Val Glu Arg His 
785 790 795 800 

40 Glu Thr Gin Val Arg Met Leu Lys Gin Arg He Ala Leu Leu Glu Glu 

805 810 815 



Glu Asn Ser Arg Pro His Thr Asn Glu Thr Ser Leu 
820 825 



(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 672 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Ala Asp Val Phe Pro Gly Asn Asp Ser Thr Ala Ser Gin Asp Val 
60 1 5 10 15 

Ala Asn Arg Phe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His 
20 25 30 

65 Glu Val Lys Asp His Lys Phe He Ala Arg Phe Phe Lys Gin Pro Thr 

35 40 45 
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Phe Cys Ser His Cys Thr Asp Phe lie Trp Gly Phe Gly Lys Gly Gly 
50 55 60 

Phe Gin Cys Gin Val Cys Cys Phe Val Val His Lys Arg Cys His Glu 
65 70 75 80 

Phe Val Thr Phe Ser Cys Pro Gly Ala Asp Lys Gly Pro Asp Thr Asp 
85 90 95 

Asp Pro Arg Ser Lys His Lys Phe Lys lie His Thr Tyr Gly Ser Pro 
100 105 110 

Thr Phe Cys Asp His Cys Gly Ser Leu Leu Tyr Gly Leu He His Gin 
115 120 125 

Gly Met Lys Cys Asp Thr Cys Asp Met Asn Val His Lys Gin Cys Val 
130 135 140 

He Asn Val Pro Ser Leu Cys Gly Met Asp His Thr Glu Lys Arg Gly 
145 150 155 160 

Arg He Tyr Leu Lys Ala Glu Val Ala Asp Glu Lys Leu His Val Thr 
165 170 175 

Val Arg Asp Ala Lys Asn Leu He Pro Met Asp Pro Asn Gly Leu Ser 
180 185 190 

Asp Pro Tyr Val Lys Leu Lys Leu He Pro Asp Pro Lys Asn Glu Ser 
195 200 205 

Lys Gin Lys Thr Lys Thr He Arg Ser Thr Leu Asn Pro Gin Trp Asn 
210 215 220 

Glu Ser Phe Thr Phe Lys Leu Lys Pro Ser Asp Lys Asp Arg Arg Leu 
225 230 235 240 

Ser Val Glu He Trp Asp Trp Asp Arg Thr Thr Arg Asn Asp Phe Met 
245 250 255 

Gly Ser Leu Ser Phe Gly Val Ser Glu Leu Met Lys Met Pro Ala Ser 
260 265 270 

Gly Trp Tyr Lys Leu Leu Asn Gin Glu Glu Gly Glu Tyr Tyr Asn Val 
275 280 285 

Pro lie Pro Glu Gly Asp Glu Glu Gly Asn Met Glu Leu Arg Gin Lys 
290 295 300 

Phe Glu Lys Ala Lys Leu Gly Pro Ala Gly Asn Lys Val He Ser Pro 
305 310 315 320 

Ser Glu Asp Arg Lys Gin Pro Ser Asn Asn Leu Asp Arg Val Lys Leu 
325 330 335 

Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser Phe Gly Lys 
340 345 350 

Val Met Leu Ala Asp Arg Lys Gly Thr Glu Glu Leu Tyr Ala He Lys 
355 360 365 

He Leu Lys Lys Asp Val Val He Gin Asp Asp Asp Val Glu Cys Thr 
370 375 380 

Met Val Glu Lys Arg Val Leu Ala Leu Leu Asp Lys Pro Pro Phe Leu 
385 390 395 400 

Thr Gin Leu His Ser Cys Phe Gin Thr Val Asp Arg Leu Tyr Phe Val 
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405 410 415 

Met Glu Tyr Val Asn Gly Gly Asp Leu Met Tyr His He Gin Gin Val 
420 425 430 

5 Gly Lys Phe Lys Glu Pro Gin Ala Val Phe Tyr Ala Ala Glu He Ser 

435 440 445 

He Gly Leu Phe Phe Leu His Lys Arg Gly He He Tyr Arg Asp Leu 
10 450 455 460 

Lys Leu Asp Asn Val Met Leu Asp Ser Glu Gly His He Lys He Ala 
465 470 475 480 

15 asp Phe Gly Met Cys Lys Glu His Met Met Asp Gly Val Thr Thr Arg 

* 485 490 495 



20 



35 



50 



60 



Thr Phe Cys Gly Thr Pro Asp Tyr He Ala Pro Glu He He Ala Tyr 
500 505 510 

Gin Pro Tyr Gly Lys Ser Val Asp Trp Trp Ala Tyr Gly Val Leu Leu 
515 520 525 



Tyr Glu Met Leu Ala Gly Gin Pro Pro Phe Asp Gly Glu Asp Glu Asp 
25 530 535 540 

Glu Leu Phe Gin Ser He Met Glu His Asn Val Ser Tyr Pro Lys Ser 
545 550 555 560 

3 0 Leu Ser Lys Glu Ala Val Ser He Cys Lys Gly Leu Met Thr Lys His 

565 570 575 



Pro Ala Lys Arg Leu Gly Cys Gly Pro Glu Gly Glu Arg Asp Val Arg 
580 585 590 

Glu His Ala Phe Phe Arg Arg He Asp Trp Glu Lys Leu Glu Asn Arg 
595 600 605 

Glu He Gin Pro Pro Phe Lys Pro Lys Val Cys Gly Lys Gly Ala Glu 
40 610 615 620 

Asn Phe Asp Lys Phe Phe Thr Arg Gly Gin Pro Val Leu Thr Pro Pro 
625 630 635 640 

45 Asp Gin Leu Val He Ala Asn He Asp Gin Ser Asp Phe Glu Gly Phe 

645 650 655 

Ser Tyr Val Asn Pro Gin Phe Val His Pro He Leu Gin Ser Ala Val 
660 665 670 



(2) INFORMATION FOR SEQ ID NO: 28: 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



65 M et Asp He Leu Cys Glu Glu Asn Thr Ser Leu S r Ser Thr Thr Asn 

1 5 10 15 



10 



15 
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Ser Leu Met Gin Leu Asn Asp Asp Thr Arg Leu Tyr Ser Asn Asp Phe 
20 25 30 

Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp 
b 35 40 45 

Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser 
50 55 eo 

Cys Leu Ser Leu Leu His Leu Gin Glu Lys Asn Trp Ser Ala Leu Leu 
65 ™ 75 so 

Thr Ala Val Val He He Leu Thr He Ala Gly Asn He Leu Val He 
85 90 95 

Met Ala Val Ser Leu Glu Lys Lys Leu Gin Asn Ala Thr Asn Tyr Phe 
10 ° 105 110 

on Leu Met Ser Leu A1 * He Ala Asp Met Leu Leu Gly Phe Leu Val Met 

ZU 115 120 125 

Pro Val Ser Met Leu Thr He Leu Tyr Gly Tyr Arg Trp Pro Leu Pro 
"0 135 140 

Ser Lys Leu Cys Ala Val Trp He Tyr Leu Asp Val Leu Phe Ser Thr 
145 iSO 155 i 6 o 

Ala Ser He Met His Leu Cys Ala He Ser Leu Asp Arg Tyr Val Ala 
165 170 i 7 5 

He Gin Asn Pro He His His Ser Arg Phe Asn Ser Arg Thr Lys Ala 
180 185 190 

_ c phe Leu L YS He He Ala Val Trp Thr He Ser Val Gly He Ser Met 

35 195 200 205 

Pro He Pro Val Phe Gly Leu Gin Asp Asp Ser Lys Val Phe Lys Glu 
210 215 220 

Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu He Gly Ser Phe 
225 230 235 240 

Val Ser Phe Phe He Pro Leu Thr He Met Val He Thr Tyr Phe Leu 
245 250 255 

Thr He Lys Ser Leu Gin Lys Glu Ala Thr Leu Cys Val Ser Asp Leu 
260 265 270 

cn G1 V Thr 2*9 Ala Lys Leu Ala Ser Phe Ser Phe Leu Pro Gin Ser Ser 

bU 275 280 285 

Leu Ser Ser Glu Lys Leu Phe Gin Arg Ser He His Arg Glu Pro Gly 
290 295 300 

55 Ser Tyr Thr Gly Arg Arg Thr Met Gin Ser He Ser Asn Glu Gin Lys 

305 310 315 320 

Ala Cys Lys Val Leu Gly He Val Phe Phe Leu Phe Val Val Met Trp 
325 330 335 



25 



30 



40 



45 



60 



Cys Pro Phe Phe lie Thr Asn He Met Ala Val He Cys Lys Glu Ser 
340 345 350 

Cys Asn Glu Asp Val He Gly Ala Leu Leu Asn Val Phe Val Trp He 
65 355 360 365 

Gly Tyr Leu Ser Ser Ala Val Asn Pro Leu Val Tyr Thr Leu Phe Asn 
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370 375 380 

Lys Thr Tyr Arg Ser Ala Phe Ser Arg Tyr lie Gin Cys Gin Tyr Lys 
385 390 395 400 

5 

Glu Asn Lys Lys Pro Leu Gin Leu lie Leu Val Asn Thr He Pro Ala 
405 410 415 

Leu Ala Tyr Lys Ser Ser Gin Leu Gin Met Gly Gin Lys Lys Asn Ser 
10 420 425 430 

Lys Gin Asp Ala Lys Thr Thr Asp Asn Asp Cys Ser Met Val Ala Leu 
435 440 445 

15 Gly Lys Gin His Ser Glu Glu Ala Ser Lys Asp Asn Ser Asp Gly Val 

450 455 460 



20 



45 



60 



Asn Glu Lys Val Ser Cys Val 
465 470 



(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 481 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNE5S : single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Ala Leu Ser Tyr Arg Val Ser Glu Leu Gin Ser Thr lie Pro Glu 
35 1 5 10 15 

His lie Leu Gin Ser Thr Phe Val His Val He Ser Ser Asn Trp Ser 
20 25 30 

40 Gly Leu Gin Thr Glu Ser He Pro Glu Glu Met Lys Gin He Val Glu 

35 40 45 



Glu Gin Gly Asn Lys Leu His Trp Ala Ala Leu Leu He Leu Met Val 
50 55 60 

He He Pro Thr He Gly Gly Asn Thr Leu Val He Leu Ala Val Ser 
65 70 75 80 

Leu Glu Lys Lys Leu Gin Tyr Ala Thr Asn Tyr Phe Leu Met Ser Leu 
50 85 90 95 

Ala Val Ala Asp Leu Leu Val Gly Leu Phe Val Met Pro He Ala Leu 
100 105 HO 

55 Leu Thr He Met Phe Glu Ala Met Trp Pro Leu Pro Leu Val Leu Cys 

115 120 125 



Pro Ala Trp Leu Phe Leu Asp Val Leu Phe Ser Thr Ala Ser He Met 
130 135 140 

His Leu Cys Ala He Ser Val Asp Arg Tyr He Ala He Lys Lys Pro. 

145 150 155 160 

He Gin Ala Asn Gin Tyr Asn Ser Arg Ala Thr Ala Phe He Lys He 

65 165 170 175 

Thr Val Val Trp Leu He Ser He Gly He Ala He Pro Val Pro He 
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160 185 190 

Lys Gly He Glu Thr Asp Val Asp Asn Pro Asn Asn He Thr Cys Val 
195 200 205 

Leu Thr Lys Glu Arg Phe Gly Asp Phe Met Leu Phe Gly Ser Leu Ala 
210 215 220 

Ala Phe Phe Thr Pro Leu Ala He Met He Val Thr Tyr Phe Leu Thr 
10 225 230 235 240 

He His Ala Leu Gin Lys Lys Ala Tyr Leu Val Lys Asn Lys Pro Pro 
245 250 255 

15 Gin Arg Leu Thr Trp Leu Thr Val Ser Thr Val Phe Gin Arg Asp Glu 

260 265 270 



20 



35 



50 



Thr Pro Cys Ser Ser Pro Glu Lys Val Ala Met Leu Asp Gly Ser Arg 
275 280 285 

Lys Asp Lys Ala Leu Pro Asn Ser Gly Asp Glu Thr Leu Met Arg Arg 
290 295 300 



Thr Ser Thr He Gly Lys Lys Ser Val Gin Thr He Ser Asn Glu Gin 

25 305 310 315 320 

Arg Ala Ser Lys Val Leu Gly He Val Phe Phe Leu Phe Leu Leu Met 
325 330 335 

3 0 Trp Cys Pro Phe Phe He Thr Asn He Thr Leu Val Leu Cys Asp Ser 

340 345 350 



Cys Asn Gin Thr Thr Leu Gin Met Leu Leu Glu He Phe Val Trp He 
355 360 365 

Gly Tyr Val Ser Ser Gly Val Asn Pro Leu Val Tyr Thr Leu Phe Asn 
370 375 380 



Lys Thr Phe Arg Asp Ala Phe Gly Arg Tyr He Thr Cys Asn Tyr Arg 
40 385 390 395 400 

Ala Thr Lys Ser Val Lys Thr Leu Arg Lys Arg Ser Ser Lys He Tyr 

405 410 415 

45 Phe Arg Asn Pro Met Ala Glu Asn Ser Lys Phe Phe Lys Lys His Gly 

420 425 430 



He Arg Asn Gly He Asn Pro Ala Met Tyr Gin Ser Pro Met Arg Leu 
435 440 445 

Arg Ser Ser Thr He Gin Ser Ser Ser He He Leu Leu Asp Thr Leu 
450 455 460 



Leu Leu Thr Glu Asn Glu Gly Asp Lys Thr Glu Glu Gin Val Ser Val 
55 465 470 475 480 

Val 

60 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2843 amino acids 

(B) TYPE: amino acid 

65 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Ala Ala Ala Ser Tyr Asp Gin Leu Leu Lys Gin Val Glu Ala Leu 
15 10 15 

Lys Met Glu Asn Ser Asn Leu Arg Gin Glu Leu Glu Asp Asn Ser Asn 
20 25 30 

His Leu Thr Lys Leu Glu Thr Glu Ala Ser Asn Met Lys Glu Val Leu 
35 40 45 

Lys Gin Leu Gin Gly Ser lie Glu Asp Glu Ala Met Ala Ser Ser Gly 
50 55 60 

Gin lie Asp Leu Leu Glu Arg Leu Lys Glu Leu Asn Leu Asp Ser Ser 
65 70 75 80 

Asn Phe Pro Gly Val Lys Leu Arg Ser Lys Met Ser Leu Arg Ser Tyr 
85 90 95 

Gly Ser Arg Glu Gly Ser Val Ser Ser Arg Ser Gly Glu Cys Ser Pro 
100 105 110 

Val Pro Met Gly Ser Phe Pro Arg Arg Gly Phe Val Asn Gly Ser Arg 
115 120 125 

Glu Ser Thr Gly Tyr Leu Glu Glu Leu Glu Lys Glu Arg Ser Leu Leu 
130 135 140 

Leu Ala Asp Leu Asp Lys Glu Glu Lys Glu Lys Asp Trp Tyr Tyr Ala 
145 150 155 160 

Gin Leu Gin Asn Leu Thr Lys Arg lie Asp Ser Leu Pro Leu Thr Glu 
165 170 175 

Asn Phe Ser Leu Gin Thr Asp Met Thr Arg Arg Gin Leu Glu Tyr Glu 
180 185 190 

Ala Arg Gin He Arg Val Ala Met Glu Glu Gin Leu Gly Thr Cys Gin 
195 200 205 

Asp Met Glu Lys Arg Ala Gin Arg Arg He Ala Arg He Gin Gin He 
210 215 220 

Glu Lys Asp He Leu Arg He Arg Gin Leu Leu Gin Ser Gin Ala Thr 
225 230 235 240 

Glu Ala Glu Arg Ser Ser Gin Asn Lys His Glu Thr Gly Ser His Asp 
245 250 255 

Ala Glu Arg Gin Asn Glu Gly Gin Gly Val Gly Glu He Asn Met Ala 
260 265 270 

Thr Ser Gly Asn Gly Gin Gly Ser Thr Thr Arg Met Asp His Glu Thr 
275 280 285 

Ala Ser Val Leu Ser Ser Ser Ser Thr His Ser Ala Pro Arg Arg Leu 
290 295 300 

Thr Ser His Leu Gly Thr Lys Val Glu Met Val Tyr Ser Leu Leu Ser 
305 310 315 320 

Met Leu Gly Thr His Asp Lys Asp Asp Met Ser Arg Thr Leu Leu Ala 
325 330 335 
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Met Ser Ser Ser Gin Asp Ser Cys lie Ser Met Arg Gin Ser Gly Cys 
340 345 350 

Leu Pro Leu Leu lie Gin Leu Leu His Gly Asn Asp Lys Asp Ser Val 
355 360 365 

Leu Leu Gly Asn Ser Arg Gly Ser Lys Glu Ala Arg Ala Arg Ala Ser 
370 375 380 

Ala Ala Leu His Asn lie lie His Ser Gin Pro Asp Asp Lys Arg Gly 
385 390 395 400 

Arg Arg Glu lie Arg Val Leu His Leu Leu Glu Gin lie Arg Ala Tyr 
405 410 415 

Cys Ser Thr Cys Trp Glu Trp Gin Glu Ala His Glu Pro Gly Met Asp 
420 425 430 

Gin Asp Lys Asn Pro Met Pro Ala Pro Val Glu His Gin lie Cys Pro 
435 440 445 

Ala Val Cys Val Leu Met Lys Leu Ser Phe Asp Glu Glu His Arg His 
450 455 460 

Ala Met Asn Glu Leu Gly Gly Leu Gin Ala He Ala Glu Leu Leu Gin 
465 470 475 480 

Val Asp Cys Glu Met Tyr Gly Leu Thr Asn Asp His Tyr Ser He Thr 
485 490 495 

Leu Arg Arg Tyr Ala Gly Met Ala Leu Thr Asn Leu Thr Phe Gly Asp 
500 505 510 

Val Ala Asn Lys Ala Thr Leu Cys Ser Met Lys Gly Cys Met Arg Ala 
515 520 525 

Leu Val Ala Gin Leu Lys Ser Glu Ser Glu Asp Leu Gin Gin Val He 
530 535 540 

Ala Ser Val Leu Arg Asn Leu Ser Trp Arg Ala Asp Val Asn Ser Lys 
545 550 555 560 

Lys Thr Leu Arg Glu Val Gly Ser Val Lys Ala Leu Met Glu Cys Ala 
565 570 575 

Leu Glu Val Lys Lys Glu Ser Thr Leu Lys Ser Val Leu Ser Ala Leu 
580 585 590 

Trp Asn Leu Ser Ala His Cys Thr Glu Asn Lys Ala Asp He Cys Ala 
595 600 605 

Val Asp Gly Ala Leu Ala Phe Leu Val Gly Thr Leu Thr Tyr Arg Ser 
610 615 620 

Gin Thr Asn Thr Leu Ala He He Glu Ser Gly Gly Gly He Leu Arg 
625 630 635 640 

Asn Val Ser Ser Leu He Ala Thr Asn Glu Asp His Arg Gin He Leu 
645 650 655 

Arg Glu Asn Asn Cys Leu Gin Thr Leu Leu Gin His Leu Lys Ser His 
660 665 670 

Ser Leu Thr He Val Ser Asn Ala Cys Gly Thr Leu Trp Asn Leu Ser 
675 680 685 

Ala Arg Asn Pro Lys Asp Gin Glu Ala Leu Trp Asp Met Gly Ala Val 
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690 695 700 

Ser Met Leu Lys Asn Leu lie His Ser Lys His Lys Met He Ala Met 
705 710 715 720 

Gly Ser Ala Ala Ala Leu Arg Asn Leu Met Ala Asn Arg Pro Ala Lys 
725 730 735 

Tyr Lys Asp Ala Asn He Met Ser Pro Gly Ser Ser Leu Pro Ser Leu 
740 745 750 

His Val Arg Lys Gin Lys Ala Leu Glu Ala Glu Leu Asp Ala Gin His 
755 760 765 

Leu Ser Glu Thr Phe Asp Asn He Asp Asn He Ser Pro Lys Ala Ser 
770 775 780 

His Arg Ser Lys Gin Arg His Lys Gin Ser Leu Tyr Gly Asp Tyr Val 
785 790 795 800 

Phe Asp Thr Asn Arg His Asp Asp Asn Arg Ser Asp Asn Phe Asn Thr 
805 810 815 

Gly Asn Met Thr Val Leu Ser Pro Tyr Leu Asn Thr Thr Val Leu Pro 
820 825 830 

Ser Ser Ser Ser Ser Arg Gly Ser Leu Asp Ser Ser Arg Ser Glu Lys 
835 840 845 

Asp Arg Ser Leu Glu Arg Glu Arg Gly He Gly Leu Gly Asn Tyr His 
850 855 860 

Pro Ala Thr Glu Asn Pro Gly Thr Ser Ser Lys Arg Gly Leu Gin lie 
865 870 875 880 

Ser Thr Thr Ala Ala Gin He Ala Lys Val Met Glu Glu Val Ser Ala 
885 890 895 

He His Thr Ser Gin Glu Asp Arg Ser Ser Gly Ser Thr Thr Glu Leu 
900 905 910 

His Cys Val Thr Asp Glu Arg Asn Ala Leu Arg Arg Ser Ser Ala Ala 
915 920 925 

His Thr His Ser Asn Thr Tyr Asn Phe Thr Lys Ser Glu Asn Ser Asn 
930 935 940 

Arg Thr Cys Ser Met Pro Tyr Ala Lys Leu Glu Tyr Lys Arg Ser Ser 
945 950 955 960 

Asn Asp Ser Leu Asn Ser Val Ser Ser Ser Asp Gly Tyr Gly Lys Arg 
965 970 975 

Gly Gin Met Lys Pro Ser He Glu Ser Tyr Ser Glu Asp Asp Glu Ser 
980 985 990 

Lys Phe Cys Ser Tyr Gly Gin Tyr Pro Ala Asp Leu Ala His Lys He 
995 1000 1005 

His Ser Ala Asn His Met Asp Asp Asn Asp Gly Glu Leu Asp Thr Pro 
1010 1015 1020 

lie Asn Tyr Ser Leu Lys Tyr Ser Asp Glu Gin Leu Asn Ser Gly Arg 
1025 1030 1035 1040 

Gin Ser Pro Ser Gin Asn Glu Arg Trp Ala Arg Pro Lys His He He 
1045 1050 1055 
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Glu Asp Glu He Lys Gin Ser Glu Gin Arg Gin Ser Arg Asn Gin Ser 
1060 1065 1070 

Thr Thr Tyr Pro Val Tyr Thr Glu Ser Thr Asp Asp Lys His Leu Lys 
5 1075 1080 1085 

Phe Gin Pro His Phe Gly Gin Gin Glu Cys Val Ser Pro Tyr Arg Ser 
1090 1095 1100 

10 Arg Gly Ala Asn Gly Ser Glu Thr Asn Arg Val Gly Ser Asn His Gly 

1105 1110 1115 1120 



15 



30 



45 



60 



He Asn Gin Asn Val Ser Gin Ser Leu Cys Gin Glu Asp Asp Tyr Glu 
1125 1130 1135 

Asp Asp Lys Pro Thr Asn Tyr Ser Glu Arg Tyr Ser Glu Glu Glu Gin 
1140 1145 1150 



His Glu Glu Glu Glu Arg Pro Thr Asn Tyr Ser He Lys Tyr Asn Glu 
20 1155 1160 1165 

Glu Lys Arg His Val Asp Gin Pro He Asp Tyr Ser He Leu Lys Ala 
1170 1175 1180 

25 Thr Asp He Pro Ser Ser Gin Lys Gin Ser Phe Ser Phe Ser Lys Ser 

1185 1190 1195 1200 



Ser Ser Gly Gin Ser Ser Lys Thr Glu His Met Ser Ser Ser Ser Glu 
1205 1210 1215 

Asn Thr Ser Thr Pro Ser Ser Asn Ala Lys Arg Gin Asn Gin Leu His 
1220 1225 1230 



Pro Ser Ser Ala Gin Ser Arg Ser Gly Gin Pro Gin Lys Ala Ala Thr 
35 1235 1240 1245 

Cys Lys Val Ser Ser He Asn Gin Glu Thr He Gin Thr Tyr Cys Val 
1250 1255 1260 

40 Glu Asp Thr Pro He Cys Phe Ser Arg Cys Ser Ser Leu Ser Ser Leu 

1265 1270 1275 1280 



Ser Ser Ala Glu Asp Glu He Gly Cys Asn Gin Thr Thr Gin Glu Ala 
1285 1290 1295 

Asp Ser Ala Asn Thr Leu Gin He Ala Glu He Lys Glu Lys He Gly 
1300 1305 1310 



Thr Arg Ser Ala Glu Asp Pro Val Ser Glu Val Pro Ala Val Ser Gin 
50 1315 1320 1325 

His Pro Arg Thr Lys Ser Ser Arg Leu Gin Gly Ser Ser Leu Ser Ser 
1330 1335 1340 

55 Glu Ser Ala Arg His Lys Ala Val Glu Phe Ser Ser Gly Ala Lys Ser 

1345 1350 1355 1360 



Pro Ser Lys Ser Gly Ala Gin Thr Pro Lys Ser Pro Pro Glu His Tyr 
1365 1370 1375 

Val Gin Glu Thr Pro Leu Met Phe Ser Arg Cys Thr Ser Val Ser Ser 

1380 1385 1390 



Leu Asp Ser Phe Glu Ser Arg Ser He Ala Ser Ser Val Gin Ser Glu 
65 1395 1400 1405 

Pro Cys Ser Gly Met Val Ser Gly He He Ser Pro Ser Asp Leu Pro 
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1410 1415 1420 

Asp Ser Pro Gly Gin Thr Met Pro Pro Ser Arg Ser Lys Thr Pro Pro 
1425 1430 1435 1440 

5 

Pro Pro Pro Gin Thr Ala Gin Thr Lys Arg Glu Val Pro Lys Asn Lys 
1445 1450 1455 

Ala Pro Thr Ala Glu Lys Arg Glu Ser Gly Pro Lys Gin Ala Ala Val 
10 1460 1465 1470 

Asn Ala Ala Val Gin Arg Val Gin Val Leu Pro Asp Ala Asp Thr Leu 

1475 1480 1485 

15 Leu His Phe Ala Thr Glu Ser Thr Pro Asp Gly Phe Ser Cys Ser Ser 

1490 1495 1500 



20 



35 



50 



65 



Ser Leu Ser Ala Leu Ser Leu Asp Glu Pro Phe lie Gin Lys Asp Val 
1505 1510 1515 1520 

Glu Leu Arg lie Met Pro Pro Val Gin Glu Asn Asp Asn Gly Asn Glu 
1525 1530 1535 



Thr Glu Ser Glu Gin Pro Lys Glu Ser Asn Glu Asn Gin Glu Lys Glu 
25 1540 1545 1550 

Ala Glu Lys Thr lie Asp Ser Glu Lys Asp Leu Leu Asp Asp Ser Asp 
1555 1560 1565 

30 Asp Asp Asp lie Glu lie Leu Glu Glu Cys He He Ser Ala Met Pro 

1570 1575 1580 



Thr Lys Ser Ser Arg Lys Ala Lys Lys Pro Ala Gin Thr Ala Ser Lys 
1585 1590 1595 1600 

Leu Pro Pro Pro Val Ala Arg Lys Pro Ser Gin Leu Pro Val Tyr Lys 
1605 1610 1615 



Leu Leu Pro Ser Gin Asn Arg Leu Gin Pro Gin Lys His Val Ser Phe 
40 1620 1625 1630 

Thr Pro Gly Asp Asp Met Pro Arg Val Tyr Cys Val Glu Gly Thr Pro 
1635 1640 1645 

45 He Asn Phe Ser Thr Ala Thr Ser Leu Ser Asp Leu Thr He Glu Ser 

1650 1655 1660 



Pro Pro Asn Glu Leu Ala Ala Gly Glu Gly Val Arg Gly Gly Ala Gin 
1665 1670 1675 1680 

Ser Gly Glu Phe Glu Lys Arg Asp Thr lie Pro Thr Glu Gly Arg Ser 
1685 1690 1695 



Thr Asp Glu Ala Gin Gly Gly Lys Thr Ser Ser Val Thr He Pro Glu 
55 1700 1705 1710 

Leu Asp Asp Asn Lys Ala Glu Glu Gly Asp He Leu Ala Glu Cys He 
1715 1720 1725 

60 Asn Ser Ala Met Pro Lys Gly Lys Ser His Lys Pro Phe Arg Val Lys 

1730 1735 1740 



Lys lie Met Asp Gin Val Gin Gin Ala Ser Ala Ser Ser Ser Ala Pro 
1745 1750 1755 1760 

Asn Lys Asn Gin Leu Asp Gly Lys Lys Lys Lys Pro Thr Ser Pro Val 
1765 1770 1775 
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Lys Pro lie Pro Gin Asn Thr Glu Tyr Arg Thr Arg Val Arg Lys Asn 
1780 1785 1790 

Ala Asp Ser Lys Asn Asn Leu Asn Ala Glu Arg Val Phe Ser Asp Asn 
1795 1800 1805 

Lys Asp Ser Lys Lys Gin Asn Leu Lys Asn Asn Ser Lys Asp Phe Asn 
1810 1815 1820 

Asp Lys Leu Pro Asn Asn Glu Asp Arg Val Arg Gly Ser Phe Ala Phe 
1825 1830 1835 1840 

Asp Ser Pro His His Tyr Thr Pro lie Glu Gly Thr Pro Tyr Cys Phe 
1845 1850 1855 

Ser Arg Asn Asp Ser Leu Ser Ser Leu Asp Phe Asp Asp Asp Asp Val 
1860 1865 1870 

Asp Leu Ser Arg Glu Lys Ala Glu Leu Arg Lys Ala Lys Glu Asn Lys 
1875 1880 1885 

Glu Ser Glu Ala Lys Val Thr Ser His Thr Glu Leu Thr Ser Asn Gin 
1890 1895 1900 

Gin Ser Ala Asn Lys Thr Gin Ala lie Ala Lys Gin Pro lie Asn Arg 
1905 1910 1915 1920 

Gly Gin Pro Lys Pro lie Leu Gin Lys Gin Ser Thr Phe Pro Gin Ser 
1925 1930 1935 

Ser Lys Asp lie Pro Asp Arg Gly Ala Ala Thr Asp Glu Lys Leu Gin 
1940 1945 1950 

Asn Phe Ala lie Glu Asn Thr Pro Val Cys Phe Ser His Asn Ser Ser 
1955 1960 1965 

Leu Ser Ser Leu Ser Asp He Asp Gin Glu Asn Asn Asn Lys Glu Asn 
1970 1975 1980 

Glu Pro He Lys Glu Thr Glu Pro Pro Asp Ser Gin Gly Glu Pro Ser 
1985 1990 1995 2000 

Lys Pro Gin Ala Ser Gly Tyr Ala Pro Lys Ser Phe His Val Glu Asp 
2005 2010 2015 

Thr Pro Val Cys Phe Ser Arg Asn Ser Ser Leu Ser Ser Leu Ser He 
2020 2025 2030 

Asp Ser Glu Asp Asp Leu Leu Gin Glu Cys He Ser Ser Ala Met Pro 
2035 2040 2045 

Lys Lys Lys Lys Pro Ser Arg Leu Lys Gly Asp Asn Glu Lys His Ser 
2050 2055 2060 

Pro Arg Asn Met Gly Gly lie Leu Gly Glu Asp Leu Thr Leu Asp Leu 
2065 2070 2075 2080 

Lys Asp He Gin Arg Pro Asp Ser Glu His Gly Leu Ser Pro Asp Ser 
2085 2090 2095 

Glu Asn Phe Asp Trp Lys Ala He Gin Glu Gly Ala Asn Ser He Val 
2100 2105 2110 

Ser Ser Leu His Gin Ala Ala Ala Ala Ala Cys Leu Ser Arg Gin Ala 
2115 2120 2125 

Ser Ser Asp Ser Asp Ser He Leu Ser Leu Lys Ser Gly He Ser Leu 
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2130 2135 2140 

Gly Ser Pro Phe His Leu Thr Pro Asp Gin Glu Glu Lys Pro Phe Thr 
2145 2150 2155 2160 

5 

Ser Asn Lys Gly Pro Arg lie Leu Lys Pro Gly Glu Lys Ser Thr Leu 
2165 2170 2175 

Glu Thr Lys Lys lie Glu Ser Glu Ser Lys Gly He Lys Gly Gly Lys 
10 2180 2185 2190 

Lys Val Tyr Lys Ser Leu He Thr Gly Lys Val Arg Ser Asn Ser Glu 
2195 2200 2205 

15 He Ser Gly Gin Met Lys Gin Pro Leu Gin Ala Asn Met Pro Ser He 

2210 2215 2220 



20 



35 



50 



65 



Ser Arg Gly Arg Thr Met He His He Pro Gly Val Arg Asn Ser Ser 
2225 2230 2235 2240 

Ser Ser Thr Ser Pro Val Ser Lys Lys Gly Pro Pro Leu Lys Thr Pro 

2245 2250 2255 



Ala Ser Lys Ser Pro Ser Glu Gly Gin Thr Ala Thr Thr Ser Pro Arg 
25 2260 2265 2270 

Gly Ala Lys Pro Ser Val Lys Ser Glu Leu Ser Pro Val Ala Arg Gin 
2275 2280 2285 

30 Thr Ser Gin He Gly Gly Ser Ser Lys Ala Pro Ser Arg Ser Gly Ser 

2290 2295 2300 



Arg Asp Ser Thr Pro Ser Arg Pro Ala Gin Gin Pro Leu Ser Arg Pro 
2305 2310 2315 2320 

He Gin Ser Pro Gly Arg Asn Ser He Ser Pro Gly Arg Asn Gly He 
2325 2330 2335 



Ser Pro Pro Asn Lys He Ser Gin Leu Pro Arg Thr Ser Ser Pro Ser 
40 2340 2345 2350 

Thr Ala Ser Thr Lys Ser Ser Gly Ser Gly Lys Met Ser Tyr Thr Ser 
2355 2360 2365 

45 Pro Gly Arg Gin Met Ser Gin Gin Asn Leu Thr Lys Gin Thr Gly Leu 

2370 2375 2380 



Ser Lys Asn Ala Ser Ser He Pro Arg Ser Glu Ser Ala Ser Lys Gly 
2385 2390 2395 2400 

Leu Asn Gin Met Asn Asn Gly Asn Gly Ala Asn Lys Lys Val Glu Leu 
2405 2410 2415 



Ser Arg Met Ser Ser Thr Lys Ser Ser Gly Ser Glu Ser Asp Arg Ser 
55 2420 2425 2430 

Glu Arg Pro Val Leu Val Arg Gin Ser Thr Phe He Lys Glu Ala Pro 
2435 2440 2445 

60 Ser Pro Thr Leu Arg Arg Lys Leu Glu Glu Ser Ala Ser Phe Glu Ser 

2450 2455 2460 



Leu Ser Pro Ser Ser Arg Pro Ala Ser Pro Thr Arg Ser Gin Ala Gin 
2465 2470 2475 2480 

Thr Pro Val Leu Ser Pro Ser Leu Pro Asp Met Ser Leu Ser Thr His 
2485 2490 2495 
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Ser Ser Val Gin Ala Gly Gly Trp Arg Lys Leu Pro Pro Asn Leu Ser 
2500 2505 2510 

Pro Thr lie Glu Tyr Asn Asp Gly Arg Pro Ala Lys Arg His Asp lie 
2515 2520 2525 

Ala Arg Ser His Ser Glu Ser Pro Ser Arg Leu Pro lie Asn Arg Ser 
2530 2535 2540 

Gly Thr Trp Lys Arg Glu His Ser Lys His Ser Ser Ser Leu Pro Arg 
2545 2550 2555 2560 

Val Ser Thr Trp Arg Arg Thr Gly Ser Ser Ser Ser lie Leu Ser Ala 
2565 2570 2575 

Ser Ser Glu Ser Ser Glu Lys Ala Lys Ser Glu Asp Glu Lys His Val 
2580 2585 2590 

Asn Ser lie Ser Gly Thr Lys Gin Ser Lys Glu Asn Gin Val Ser Ala 
2595 2600 2605 

Lys Gly Thr Trp Arg Lys lie Lys Glu Asn Glu Phe Ser Pro Thr Asn 
2610 2615 2620 

Ser Thr Ser Gin Thr Val Ser Ser Gly Ala Thr Asn Gly Ala Glu Ser 
2625 2630 2635 2640 

Lys Thr Leu lie Tyr Gin Met Ala Pro Ala Val Ser Lys Thr Glu Asp 
2645 2650 2655 

Val Trp Val Arg lie Glu Asp Cys Pro lie Asn Asn Pro Arg Ser Gly 
2660 2665 2670 

Arg Ser Pro Thr Gly Asn Thr Pro Pro Val lie Asp Ser Val Ser Glu 
2675 2680 2685 

Lys Ala Asn Pro Asn lie Lys Asp Ser Lys Asp Asn Gin Ala Lys Gin 
2690 2695 2700 

Asn Val Gly Asn Gly Ser Val Pro Met Arg Thr Val Gly Leu Glu Asn 
2705 2710 2715 2720 

Arg Leu Asn Ser Phe lie Gin Val Asp Ala Pro Asp Gin Lys Gly Thr 
2725 2730 2735 

Glu lie Lys Pro Gly Gin Asn Asn Pro Val Pro val Ser Glu Thr Asn 
2740 2745 2750 

Glu Ser Ser lie Val Glu Arg Thr Pro Phe Ser Ser Ser Ser Ser Ser 
2755 2760 2765 

Lys His Ser Ser Pro Ser Gly Thr Val Ala Ala Arg Val Thr Pro Phe 
2770 2775 2780 

Asn Tyr Asn Pro Ser Pro Arg Lys Ser Ser Ala Asp Ser Thr Ser Ala 
2785 2790 2795 2800 

Arg Pro Ser Gin lie Pro Thr Pro Val Asn Asn Asn Thr Lys Lys Arg 
2805 2810 2815 

Asp Ser Lys Thr Asp Ser Thr Glu Ser Ser Gly Thr Gin Ser Pro Lys 
2820 2825 2830 

Arg His Ser Gly Ser Tyr Leu Val Thr Ser Val 
2835 2840 



WO 98/05347 PCT/US97/12677 

-61- 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 65 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
{iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CGGAATTCNN NNNNNNNAAC AGCNNNNNNN NNAATGAANN NCAAAGTCTG NNNTGAGGAT 60 
CCTCA 6 5 



10 



20 



35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: other nucleic acid 

{iv) ANTI - SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGGAATTCGA CTCAGAANNN NNNAACTTCA GANNNNNNAT CNNNNNNNNN GTCTGAGGAT 60 
CCTCA 65 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 65 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
55 CGGAATTCNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNTGAGGAT 60 

CCTCA 65 
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What is claimed is: 

1 . A composition capable of inhibiting specific binding 
5 between a signal -transducing protein and a 

cytoplasmic protein containing the amino acid 
sequence (G/S/A/E) -L-G- (F/I/L) , wherein each - 
represents a peptide bond, each parenthesis encloses 
amino acids which are alternatives to one other, and 
10 each slash within such parentheses separating the 

alternative amino acids. 

2. The composition of claim 1, wherein the cytoplasmic 
protein contains the amino acid sequence (K/R/Q) -X n - 

15 (G/S/A/E) -L-G- (F/I/L) , wherein X represents any 

amino acid which is selected from the group 
comprising the twenty naturally occurring amino 
acids and n represents at least 2, but not more than 
4 . 

20 

3. The composition of claim 1, wherein the cytoplasmic 
protein contains the amino acid sequence SLGI . 

4. The composition of claim 1, wherein the signal - 
25 transducing protein has at its carboxyl terminus the 

amino acid sequence (S/T) -X- (V/I/L) , wherein each - 
represents a peptide bond, each parenthesis encloses 
amino acids which are alternatives to one other, 
each slash within such parentheses separating the 
30 alternative amino acids, and the X represents any 

amino acid which is selected from the group 
comprising the twenty naturally occurring amino 
acids . 



35 



5. 



The composition of claim 1, wherein the composition 
comprises an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic 



10 



15 
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compound, a polypeptide, or a protein. 

6. The composition of claim 5, wherein the peptide 
comprises the sequence (S/T) -X- (V/I/L) -COOH, wherein 
each - represents a peptide bond, each parenthesis 
encloses amino acids which are alternatives to one 
other, each slash within such parentheses separating 
the alternative amino acids, the X represents any 
amino acid which is selected from the group 
comprising the twenty naturally occurring amino 
acids. 

7. The composition of claim 6, wherein the peptide has 
the amino acid sequence DSENSNFRNEIQSLV. 

8. The composition of claim 6, wherein the peptide has 
the amino acid sequence RNEIQSLV. 

9. The composition of claim 6, wherein the peptide has 
20 the amino acid sequence NEIQSLV. 

10. The composition of claim 6, wherein the peptide has 
the amino acid sequence EIQSLV. 

25 11. The composition of claim 6, wherein the peptide has 

the amino acid sequence IQSLV. 

12. The composition of claim 6, wherein the peptide has 
the amino acid sequence QSLV. 

30 

13. The composition of claim 6, wherein the peptide has 
the amino acid sequence SLV. 

14. The composition of claim 6, wherein the peptide has 
35 the amino acid sequence IPPDSEDGNEEQSLV. 



15. 



The composition of claim 6, wherein the peptide has 
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the amino acid sequence DSEMYNFRSQLASW . 



The composition of claim 6, wherein the peptide has 
the amino acid sequence IDLASEFLFLSNSFL. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence PPTCSQANSGRISTL. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence SDSNMNMNELSEV . 

The composition of claim 6, wherein the peptide has 
the amino acid sequence QNFRTYIVSFV. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence RETIESTV. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence RGFISSLV, 

The composition of claim 6, wherein the peptide has 
the amino acid sequence TIQSVI . 

The composition of claim 6, wherein the peptide has 
the amino acid sequence ESLV. 

The composition of claim 6, wherein the organic 
compound has the sequence Ac-SLV-COOH, wherein the 
Ac represents an acetyl, each - represent a peptide 
bond. 

A composition capable of inhibiting specific binding 
between a signal -transducing protein having at its 
carboxyl terminus the amino acid sequence (S/T) -X- 
(V/I/L) , wherein each - represents a peptide bond, 
each parenthesis encloses amino acids which are 
alternatives to one other, each slash within such 
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parentheses separating the alternative amino acids, 
the X represents any amino acid which is selected 
from the group comprising the twenty naturally 
occurring amino acids, and a cytoplasmic protein. 

The composition of claim 25 # wherein the composition 
comprises an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic 
compound, a polypeptide or a protein. 

A method of identifying a compound capable of 
inhibiting specific binding between a signal - 
transducing protein and a cytoplasmic protein 
containing the amino acid sequence (G/S/A/E) -L-G- 
(F/I/L) , wherein each - represents a peptide bond, 
each parenthesis encloses amino acids which are 
alternatives to one other, each slash within such 
parentheses separating the alternative amino acids, 
which comprises: 

(a) contacting the cytoplasmic protein bound to 
the signal -transducing protein with a 
plurality of compounds under conditions 
permitting binding between a known compound 
previously shown to be able to displace the 
signal -transducing protein bound to the 
cytoplasmic protein and the bound cytoplasmic 
protein to form a complex; and 

(b) detecting the displaced signal -transducing 
protein or the complex formed in step (a) , 
wherein the displacement indicates that the 
compound is capable of inhibiting specific 
binding between the signal -transducing protein 
and the cytoplasmic protein. 

The method of claim 27, wherein the inhibition of 
specific binding between the signal -transducing 
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protein and the cytoplasmic protein affects the 
transcription activity of a reporter gene. 

The method of claim 28, where in step (b) the 
displaced signal -transducing protein or the complex 
is detected by comparing the transcription activity 
of a reporter gene before and after the contacting 
with the compound in step (a) , where a change of the 
activity indicates that the specific binding between 
the signal -transducing protein and the cytoplasmic 
protein is inhibited and the signal-transducing 
protein is displaced. 

The method of claim 27, wherein the cytoplasmic 
protein is bound to a solid support. 

The method of claim 27, wherein the compound is 
bound to a solid support. 

The method of claim 27, wherein the compound 
comprises an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic 
compound, a polypeptide or a protein. 

The method of claim 27, wherein the contacting of 
step (a) is in vitro . 

The method of claim 27, wherein the contacting of 
step (a) is in vivo . 

The method of claim 34, wherein the contacting of 
step (a) is in a yeast cell. 

The method of claim 34, wherein the contacting or 
step (a) is in a mammalian cell. 



The method of claim 27, wherein the signal- 
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transducing protein is a cell surface receptor. 

The method of claim 27, wherein the signal - 
transducing protein is a signal transducer protein. 

The method of claim 27, wherein the signal - 
transducing protein is a tumor suppressor protein. 

The method of claim 37, wherein the cell surface 
protein is the Fas receptor. 

The method of claim 40, wherein the Fas receptor is 
expressed in cells derived from organs comprising 
the thymus, liver, kidney, colon, ovary, breast, 
testis, spleen, stomach, prostate, uterus, skin, 
head and neck. 

The method of claim 40, wherein the Fas receptor is 
expressed in cells comprising T-cells and B-cells. 

The method of claim 37, wherein the cell-surface 
receptor is the CD4 receptor. 

The method of claim 37, wherein the cell-surface 
receptor is the p75 receptor. 

The method of claim 37, wherein the cell -surface 
receptor is the serotonin 2A receptor. 

The method of claim 37, wherein the cell -surface 
receptor is the serotonin 2B receptor. 

The method of claim 38, wherein the signal 
transducer protein is Protein Kinase-C-Qf-type . 

The method of claim 39, wherein the tumor suppressor 
protein is adenomatosis polyposis coli tumor 
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suppressor protein. 

49. The method of claim 39, wherein the tumor suppressor 
protein protein is the colorectal mutant cancer 

5 protein. 

50. The method of claim 27, wherein the cytoplasmic 
protein contains the amino acid sequence SLGI, 
wherein each - represents a peptide bond, each 

10 parenthesis encloses amino acids which are 

alternatives to one other, and each slash within 
such parentheses separating the alternative amino 
acids . 

15 51. The method of claim 40, wherein the cytoplasmic 

protein is Fas-associated phosphatase- 1 . 

52 . A method of identifying a compound capable of 
inhibiting specific binding between a signal- 

20 transducing protein having at its carboxyl terminus 

the amino acid sequence (S/T) -X- (V/I/L) , wherein 
each - represents a peptide bond, each parenthesis 
encloses amino acids which are alternatives to one 
other, each slash within such parentheses separating 

25 the alternative amino acids, the X represents any 

amino acid which is selected from the group 
comprising the twenty naturally occurring amino 
acids, and a cytoplasmic protein, which comprises: 

30 (a) contacting the signal -transducing protein 

bound to the cytoplasmic protein with a 
plurality of compounds under conditions 
permitting binding between a known compound 
previously shown to be able to displace the 

35 cytoplasmic protein bound to the signal - 

transducing protein and the bound signal - 
transducing protein to form a complex; and 
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<b) detecting the displaced cytoplasmic protein or 
the complex of step (a) wherein the 
displacement indicates that the compound is 
capable of inhibiting specific binding between 
5 the signal-transducing protein and the 

cytoplasmic protein. 

53. The method of claim 52, wherein the inhibition of 
specific binding between the signal- transducing 

10 protein and the cytoplasmic protein affects the 

transcription activity of a reporter gene. 

54. The method of claim 53, where in step (b) the 
displaced cytoplasmic protein or the complex is 

15 detected by comparing the transcription activity of 

a reporter gene before and after the contacting with 
the compound in step (a) , where a change of the 
activity indicates that the specific binding between 
the signal -transducing protein and the cytoplasmic 

20 protein is inhibited and the cytoplasmic protein is 

displaced. 

55. The method of claim 52, wherein the cytoplasmic 
protein is bound to a solid support . 

25 

56. The method of claim 52, wherein the compound is 
bound to a solid support. 

57. The method of claim 52, wherein the compound 
30 comprises an antibody, an inorganic compound, an 

organic compound, a peptide, a peptidomimetic 
compound, a polypeptide or a protein. 

58. The method of claim 52, wherein the contacting of 
35 step (a) is in vitro . 



59. 



The method of claim 52, wherein the contacting of 
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The method of claim 59, wherein the contacting of 
step (a) is in a yeast cell. 

The method of claim 59, wherein the contacting or 
step (a) is in a mammalian cell. 

The method of claim 52, wherein the signal - 
transducing protein is a cell surface receptor. 

The method of claim 52, wherein the signal - 
transducing protein is a signal transducer protein. 

The method of claim 52, wherein the signal- 
transducing protein is a tumor suppressor protein. 

The method of claim 62, wherein the cell surface 
protein is the Fas receptor. 

The method of claim 65, wherein the Fas receptor is 
expressed in cells derived from organs comprising 
the thymus, liver, kidney, colon, ovary, breast, 
testis, spleen, stomach, prostate, uterus, skin, 
head and neck. 

The method of claim 65, wherein the Fas receptor is 
expressed in cells comprising T-cells and B-cells. 

The method of claim 62, wherein the cell-surface 
receptor is the CD4 receptor. 

The method of claim 62, wherein the cell-surface 
receptor is the p75 receptor. 

The method of claim 62, wherein the cell-surface 
receptor is the serotonin 2A receptor. 
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71. The method of claim 62, wherein the cell-surface 
receptor is the serotonin 2B receptor. 

5 72. The method of claim 63, wherein the signal 

transducer protein is Protein Kinase-C-a-type . 

73. The method of claim 64, wherein the tumor suppressor 
protein is adenomatosis polyposis coli tumor 

10 suppressor protein. 

74. The method of claim 64, wherein the tumor suppressor 
protein is the colorectal mutant cancer protein. 

15 75 • Th e method of claim 52, wherein the cytoplasmic 

protein contains the amino acid sequence SLGI, 
wherein each - represents a peptide bond, each 
parenthesis encloses amino acids which are 
alternatives to one other, and each slash within 

20 such parentheses separating the alternative amino 

acids . 

76. The method of claim 52, wherein the cytoplasmic 
protein is Fas-associated phosphatase- 1 . 

25 

77. A method inhibiting the proliferation of cancer 
cells comprising the composition of claim 1 . 

78. The method of claim 77, wherein the cancer cells are 
30 derived from organs comprising the thymus, liver, 

kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

79. The method of claim 77, wherein the cancer cells are 
35 derived from cells comprising T-cells and B-cells. 



80. 



A method of inhibiting the proliferation of cancer 
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cells comprising the composition of claim 25. 

81. The method of claim 80, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

82. The method of claim 80, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 



83. A method of inhibiting the proliferation of cancer 
cells comprising the compound identified by the 
method of claim 27. 

15 84. The method of claim 83, wherein the cancer cells are 

derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

20 85. The method of claim 83, wherein the cancer cells are 

derived from cells comprising T-cells and B-cells. 

86. A method of inhibiting the proliferation of cancer 
cells comprising the compound identified by the 

25 method of claim 52. 

87. The method of claim 86, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 

30 stomach, prostate, uterus, skin, head and neck. 

88. The method of claim 86, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 



35 



89. 



A method of treating cancer in a subject which 
comprises introducing to the subject's cancerous 
cells an amount of the composition of claim 1 



10 



15 
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effective to result in apoptosis of the cells. 

90. The method of claim 89, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

91. The method of claim 89, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

92. A method of treating cancer in a subject which 
comprises introducing to the subject's cancerous 
cells an amount of the composition of claim 25 
effective to result in apoptosis of the cells. 

93. The method of claim 92, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

94. The method of claim 92, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

95. A method of treating cancer in a subject which 
25 comprises introducing to the subject's cancerous 

cells an amount of the compound identified by the 
method of claim 27 effective to allow apoptosis of 
the cells. 

30 96. The method of claim 95, wherein the cancer cells are 

derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 



20 



35 



97. 



The method of claim 95, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 
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98. A method of treating cancer in a subject which 
comprises introducing to the subject's cancerous 
cells an amount of the compound identified by the 
method of claim 52 effective to result in apoptosis 
5 of the cells. 



99. The method of claim 98, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
10 stomach, prostate, uterus, skin, head and neck. 



100. The method of claim 98, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

15 101- A method of inhibiting the proliferation of virally 

infected cells comprising the composition of claim 
1. 



102. A method of inhibiting the proliferation of virally 
20 infected cells comprising the composition of claim 

25. 



103. A method of inhibiting the proliferation of virally 
infected cells comprising the compound identified by 
25 the method of claim 27. 



104. A method of inhibiting the proliferation of virally 
infected cells comprising the compound identified by 
the method of claim 52. 

30 

105. The method of claim 101, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 
virus, Human T-cell lymphtropic virus, type 1 or 

35 HIV. 



106. The method of claim 102, wherein the virally 
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infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 
virus, Human T-cell lymphtropic virus, type 1 or 
HIV. 

5 

107. The method of claim 103, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 
virus, Human T-cell lymphtropic virus, type 1 or 

10 HIV. 

108. The method of claim 104, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 

15 virus, Human T-cell lymphtropic virus, type 1 or 

HIV. 

109. A method of treating a virally-inf ected subject 
which comprises introducing to the subject's 

20 virally- infected cells the composition of claim 1 

effective to result in apoptosis of the cells. 

110. A method of treating a virally-inf ected subject 
which comprises introducing to the subject's virally 

25 infected cells the composition of claim 25 effective 

to result in apoptosis of the cells. 

111. A method of treating a virally-inf ected subject 
which comprises introducing to the subject's 

30 virally-inf ected cells an amount of the compound 

identified by the method of claim 27 effective to 
result in apoptosis of the cells. 

112. A method of treating a virally-inf ected subject 
35 which comprises introducing to the subject's 

virally- infected cells an amount of the compound 
identified by the method of claim 52 effective to 
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result in apoptosis of the cells. 

113. The method of claim 109, wherein the virally 
infected cells comprise the Hepatitis B virus, 

5 Epstein-Barr virus, influenza virus, Papilloma 

virus. Adeno virus, Human T-cell lymphtropic virus, 
type 1 or HIV. 

114. The method of claim 110, wherein the virally 
10 infected cells comprise the Hepatitis B virus, 

Epstein-Barr virus, influenza virus, Papilloma 
virus. Adeno virus, Human T-cell lymphtropic virus, 
type 1 or HIV. 

15 115. The method of claim 111, wherein the virally 

infected cells comprise the Hepatitis B virus, 
Epstein-Barr virus, influenza virus, Papilloma 
virus. Adeno virus, Human T-cell lymphtropic virus, 
type 1 or HIV. 

20 

116. The method of claim 112, wherein the virally 
infected cells comprise the Hepatitis B virus, 
Epstein-Barr virus, influenza virus, Papilloma 
virus. Adeno virus, Human T-cell lymphtropic virus, 

25 type 1 or HIV. 

117. A pharmaceutical composition comprising the 
composition of claim 1 in an effective amount and a 
pharmaceutically acceptable carrier. 



30 



118. A pharmaceutical composition comprising the 
composition of claim 25 in an effective amount and 
a pharmaceutically acceptable carrier. 



35 



119. 



A pharmaceutical composition comprising the compound 
identified by the method of claim 27 in an effective 
amount and a pharmaceutically acceptable carrier. 
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120. A pharmaceutical composition comprising the compound 
identified by the method of claim 52 in an effective 
amount and a pharmaceutically acceptable carrier. 
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FIG. 3A 
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FIG. 3B 
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FIG. 4C 
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FIG. 7C 
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FIG. 7H 

1 maaaeydqll kqvealkmen snlrqeledn snhltklete asnmkevlkq lqgsieceaar. 
61 assgqidlla rlkelnldss nfpgvklrsk nslrsygere gsvssrsgec spvprcgsfpr 
121 rgSvagsres tgylaeleke rsllladldk eekekdwyya qlqaltkrid slpltanfsl 
181 qtdmtrrqle yearqirvar. eaqigteqdn ekraqrriar iqqiakdilr irqllqsqa* 
241 ea«66qr.kh etgshdaerq negqgvgain natsgngqgs ttmfcetae vlssssthsa 
301 prrltshlgt kvarvysllB xnlgchdkddni srtllamsss qdscisnurqs gelplliqll 
361 hgndxdsvll gxisrgskear arasaalhni ihsqpddkrg rrairvlhli eqiraycatc 
421 wewqeahepg mdqdtapmpa pvebqicpav cvlmklsfde ehriiainnelg glqaiaellq 
481 vdcemyglts dhysislrry ag»alfcnltf gdvaskatlc snkgcaralv aqllcsesedl 
541 cqviasvlrn lswradvnsk ktlrevgsvk alneealevk kastlksvls alwnlsahcs 
601 eckadicavd galaflvgtl tyreqtntla iieagggilr nvseliatne dhrqilrann 
661 clqtLlqhlk shsltivsna cgtlwnlsar npkdqaalvd mgavsnlknl ihekhkaiax 
721 gsa&airola aarpakykda niaepgsslp slhvrkqkal aaeldaqhls etffdaidnls 
781 pkaahrekqr hkqslygdyv fdtnrhddnr sdafiatgraat vlspylnttv Ipssassrgs 
841 ldasrsekdr sl«r«rgigl gnyhpatenp gtsskrglqi sttaaqiakv neevsaihcs 
901 qedrssgstt elfccvcdem alrrssaaht hsntynftks e=enrtc3mp yakleykr3s 
961 ndslnsvsss dgygkrgqmk psiesysedd eekfesygqy padlahklhs arJmddndge 
1021 ldcpiayslk ysdeqlnsgr qspaqnerwa rpkhiiedei kqseqrqsrx qsttypvyte 
1081 stddkhlkfq phfgqqecvs pyrsrgaags etnrvgenbg isqavsqsle qeddyeddkp 
1141 tiyeexysee eqfceeeexpc nysikyneek rhvdqpidys Ikyatdipss qkqsfsfaks 
1201 ssgassktah me assents ^ pgsnakrqaq Ihpssaqsrs gqpqkaatek vssinqetiq 
1261 tycvedtpic fsresslssl ssaedeigen qttqaadsan tlqiaeikek igtrsaedpv 
1321 sevpavsqhp rtkasrlqgs elesesarWc avefssgaks psksgaqtpk sppahyyqet 
1381 plmferctsv ssldsfesrs iasevqsepc sgnvsgiisp sdlpdspgqt mppsrextpp 
1441 pppctaqtkr evpknkapta ekxesgpkqa avaaavqrvq vlpdadtllh f atescpdgf 
1501 scsssisals ldepfiqjcdv elriappvqe &dng£«taee qpkesaeaqa keaektidse 
1561 kdilddaddd dieileecii sanptkeark akkpaqtask lpppvarkps qlpvykllps 
1621 qarlqpqkhv eftpgddtrpr vyevegtpin fietateledl tiesppnela agegvrggaq 
1681 sgefekrdti ptegrstdea qggktsevti pelddnkaee gdilaecins aspkgkehkp 
1741 frvkkiadqv qqasaseeap nteqldgkkk kptspvkpip qnteyrtrvr taadsknnln 
1801 aervfsdnkd skkqnlknns kdfndklpaa edrvrgsfaf dephhytpie gcpyefsrad 
1861 slSBldfddd dvdlssekae lrkakankee eakvtshtel tsnqqsaakt qaiakqpinr 
1921 gqpkpilqkq stfipqsskdi pdxgaatdtk Iqafaientp vcfshaasls slsdidqenn 
1981 nkenepikec eppdsqgeps kpqasgyapk sfhvedtpve fsrnsslssi eideeddllq 
2041 eciseaapkk kkpsrlkgdn ekhsprtaogg ilgedUldl kdiqrpdseh glspdaaafd 
2101 wkaiqegans lveslhqaaa aaclsrqass dsdailslks gislgspfU. tpdqeekpft 
2161 ankoprilkp gekstletkk ies«skgikg gkkvyksllt gkvrsnseis gqfflkqplqan 
2221 spsisrgrtm lhlpgvrnss sstspvskkg pplktpasks psegqtatte prgakpsvxe 
2281 elspvarqts qiggsskaps rsgsrdstps zpaqqplszp iqspgrnsis pgxngisppn 
2341 klsqlprtas p6tastkssg sgtacaytspg rqasqqnltk qtgisknaas iprsesasjcg 
2401 lnqasogcga nkkvalsxns stkssgsesd rsarpvlvrq stf ikaapep ttlrrkleesa 
2461 efeslspssr pasptreqaq tpvlspslpd malstheevq aggwrklppr. leptieyndg 
2S21 rpakrhdiar shaeepsrlp inrsgtwkxe hskhssslpr vetwrrcgss ssilsasses 
2581 sekakaedak hvaaisgtkq skeaqvsakg twrkikenef sptnstsqtv 3sgatBgaas 
2641 ktliyqjaapa vsktadvwvr iedcpiaapr sgreptgatp pvidsvseka npnikdskda 
2731 qakqnvgngs vpmrtvglea rlnefiqvda pdqkgteikp gqanpvpvse snesaivert 
2761 pfsssssskh aspegtvaar vtpfnynpep rkssads?sa rpsqiptpvn nntkkrdekc 
2821 dstassgtqs pkrhsgsylv fiJDt 
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